Dataset statistics
| Number of variables | 47 |
|---|---|
| Number of observations | 2156232 |
| Missing cells | 23985531 |
| Missing cells (%) | 23.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 773.2 MiB |
| Average record size in memory | 376.0 B |
Variable types
| Numeric | 11 |
|---|---|
| DateTime | 4 |
| Categorical | 11 |
| Text | 20 |
| Unsupported | 1 |
Incident Zip is highly overall correlated with BBL and 7 other fields | High correlation |
BBL is highly overall correlated with Incident Zip and 8 other fields | High correlation |
X Coordinate (State Plane) is highly overall correlated with Incident Zip and 5 other fields | High correlation |
Y Coordinate (State Plane) is highly overall correlated with BBL and 6 other fields | High correlation |
Latitude is highly overall correlated with BBL and 6 other fields | High correlation |
Longitude is highly overall correlated with Incident Zip and 5 other fields | High correlation |
Zip Codes is highly overall correlated with Incident Zip and 8 other fields | High correlation |
Police Precincts is highly overall correlated with Incident Zip and 6 other fields | High correlation |
Agency is highly overall correlated with Agency Name and 3 other fields | High correlation |
Agency Name is highly overall correlated with Agency and 3 other fields | High correlation |
Facility Type is highly overall correlated with Incident Zip and 3 other fields | High correlation |
Borough is highly overall correlated with BBL and 9 other fields | High correlation |
Open Data Channel Type is highly overall correlated with Facility Type | High correlation |
Park Borough is highly overall correlated with BBL and 9 other fields | High correlation |
Vehicle Type is highly overall correlated with Incident Zip and 2 other fields | High correlation |
Taxi Company Borough is highly overall correlated with Incident Zip and 12 other fields | High correlation |
Borough Boundaries is highly overall correlated with BBL and 9 other fields | High correlation |
Address Type is highly imbalanced (77.4%) | Imbalance |
Facility Type is highly imbalanced (59.3%) | Imbalance |
Status is highly imbalanced (82.3%) | Imbalance |
Vehicle Type is highly imbalanced (78.6%) | Imbalance |
Closed Date has 163693 (7.6%) missing values | Missing |
Descriptor has 30750 (1.4%) missing values | Missing |
Location Type has 256708 (11.9%) missing values | Missing |
Incident Zip has 27020 (1.3%) missing values | Missing |
Incident Address has 80531 (3.7%) missing values | Missing |
Street Name has 80593 (3.7%) missing values | Missing |
Cross Street 1 has 539647 (25.0%) missing values | Missing |
Cross Street 2 has 538899 (25.0%) missing values | Missing |
Intersection Street 1 has 635953 (29.5%) missing values | Missing |
Intersection Street 2 has 634646 (29.4%) missing values | Missing |
City has 119922 (5.6%) missing values | Missing |
Landmark has 797021 (37.0%) missing values | Missing |
Facility Type has 2018905 (93.6%) missing values | Missing |
Due Date has 2148125 (99.6%) missing values | Missing |
Resolution Description has 55803 (2.6%) missing values | Missing |
Resolution Action Updated Date has 51376 (2.4%) missing values | Missing |
BBL has 261151 (12.1%) missing values | Missing |
X Coordinate (State Plane) has 33541 (1.6%) missing values | Missing |
Y Coordinate (State Plane) has 32919 (1.5%) missing values | Missing |
Vehicle Type has 2155736 (> 99.9%) missing values | Missing |
Taxi Company Borough has 2155022 (99.9%) missing values | Missing |
Taxi Pick Up Location has 2132427 (98.9%) missing values | Missing |
Bridge Highway Name has 2140247 (99.3%) missing values | Missing |
Bridge Highway Direction has 2147873 (99.6%) missing values | Missing |
Road Ramp has 2151397 (99.8%) missing values | Missing |
Bridge Highway Segment has 2140241 (99.3%) missing values | Missing |
Latitude has 33616 (1.6%) missing values | Missing |
Longitude has 33616 (1.6%) missing values | Missing |
Location has 33616 (1.6%) missing values | Missing |
Zip Codes has 42923 (2.0%) missing values | Missing |
Community Districts has 34126 (1.6%) missing values | Missing |
Borough Boundaries has 34131 (1.6%) missing values | Missing |
City Council Districts has 34126 (1.6%) missing values | Missing |
Police Precincts has 34126 (1.6%) missing values | Missing |
Request Closing Time has 163693 (7.6%) missing values | Missing |
Unique Key has unique values | Unique |
Request Closing Time is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-11-08 16:07:57.294059 |
|---|---|
| Analysis finished | 2023-11-08 16:11:13.748262 |
| Duration | 3 minutes and 16.45 seconds |
| Software version | ydata-profiling vv4.6.1 |
| Download configuration | config.json |
Unique Key
Real number (ℝ)
UNIQUE 
| Distinct | 2156232 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58193077 |
| Minimum | 57020601 |
|---|---|
| Maximum | 59353779 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 57020601 |
|---|---|
| 5-th percentile | 57142676 |
| Q1 | 57609028 |
| median | 58194522 |
| Q3 | 58776800 |
| 95-th percentile | 59239322 |
| Maximum | 59353779 |
| Range | 2333178 |
| Interquartile range (IQR) | 1167772.5 |
Descriptive statistics
| Standard deviation | 672836.97 |
|---|---|
| Coefficient of variation (CV) | 0.011562148 |
| Kurtosis | -1.2036443 |
| Mean | 58193077 |
| Median Absolute Deviation (MAD) | 583831 |
| Skewness | -0.0041783373 |
| Sum | 1.2547777 × 1014 |
| Variance | 4.5270958 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59348005 | 1 | < 0.1% |
| 57804588 | 1 | < 0.1% |
| 57804295 | 1 | < 0.1% |
| 57802015 | 1 | < 0.1% |
| 57801375 | 1 | < 0.1% |
| 57802607 | 1 | < 0.1% |
| 57802125 | 1 | < 0.1% |
| 57802445 | 1 | < 0.1% |
| 57801568 | 1 | < 0.1% |
| 57800520 | 1 | < 0.1% |
| Other values (2156222) | 2156222 |
| Value | Count | Frequency (%) |
| 57020601 | 1 | |
| 57020602 | 1 | |
| 57020603 | 1 | |
| 57020606 | 1 | |
| 57020608 | 1 | |
| 57020611 | 1 | |
| 57020612 | 1 | |
| 57020613 | 1 | |
| 57020614 | 1 | |
| 57020622 | 1 |
| Value | Count | Frequency (%) |
| 59353779 | 1 | |
| 59353778 | 1 | |
| 59353777 | 1 | |
| 59353776 | 1 | |
| 59353775 | 1 | |
| 59353774 | 1 | |
| 59353773 | 1 | |
| 59353772 | 1 | |
| 59353771 | 1 | |
| 59353721 | 1 |
Created Date
Date
| Distinct | 1768322 |
|---|---|
| Distinct (%) | 82.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| Minimum | 2023-03-12 18:51:46 |
|---|---|
| Maximum | 2023-11-07 12:00:00 |
Closed Date
Date
MISSING 
| Distinct | 1489653 |
|---|---|
| Distinct (%) | 74.8% |
| Missing | 163693 |
| Missing (%) | 7.6% |
| Memory size | 16.5 MiB |
| Minimum | 2022-11-30 17:42:00 |
|---|---|
| Maximum | 2023-11-08 22:00:00 |
Agency
Categorical
HIGH CORRELATION 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| NYPD | |
|---|---|
| HPD | |
| DSNY | |
| DOT | |
| DEP | |
| Other values (11) |
Length
| Max length | 44 |
|---|---|
| Median length | 4 |
| Mean length | 3.6197107 |
| Min length | 3 |
Characters and Unicode
| Total characters | 7804936 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DSNY |
|---|---|
| 2nd row | DSNY |
| 3rd row | DSNY |
| 4th row | DOT |
| 5th row | NYPD |
Common Values
| Value | Count | Frequency (%) |
| NYPD | 974274 | |
| HPD | 376614 | 17.5% |
| DSNY | 217308 | 10.1% |
| DOT | 124456 | 5.8% |
| DEP | 117798 | 5.5% |
| DPR | 105190 | 4.9% |
| DOB | 68736 | 3.2% |
| DOHMH | 59988 | 2.8% |
| DHS | 37152 | 1.7% |
| EDC | 35460 | 1.6% |
| Other values (6) | 39256 | 1.8% |
Length
| Value | Count | Frequency (%) |
| nypd | 974274 | |
| hpd | 376614 | 17.5% |
| dsny | 217308 | 10.1% |
| dot | 124456 | 5.8% |
| dep | 117798 | 5.5% |
| dpr | 105190 | 4.9% |
| dob | 68736 | 3.2% |
| dohmh | 59988 | 2.8% |
| dhs | 37152 | 1.7% |
| edc | 35460 | 1.6% |
| Other values (11) | 41016 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 2132359 | |
| P | 1584830 | |
| N | 1192990 | |
| Y | 1191582 | |
| H | 533742 | 6.8% |
| O | 255993 | 3.3% |
| S | 254812 | 3.3% |
| E | 156015 | 2.0% |
| T | 150089 | 1.9% |
| R | 106950 | 1.4% |
| Other values (11) | 245574 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7803176 | |
| Space Separator | 1760 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2132359 | |
| P | 1584830 | |
| N | 1192990 | |
| Y | 1191582 | |
| H | 533742 | 6.8% |
| O | 255993 | 3.3% |
| S | 254812 | 3.3% |
| E | 156015 | 2.0% |
| T | 150089 | 1.9% |
| R | 106950 | 1.4% |
| Other values (10) | 243814 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1760 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7803176 | |
| Common | 1760 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 2132359 | |
| P | 1584830 | |
| N | 1192990 | |
| Y | 1191582 | |
| H | 533742 | 6.8% |
| O | 255993 | 3.3% |
| S | 254812 | 3.3% |
| E | 156015 | 2.0% |
| T | 150089 | 1.9% |
| R | 106950 | 1.4% |
| Other values (10) | 243814 | 3.1% |
Common
| Value | Count | Frequency (%) |
| 1760 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7804936 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| D | 2132359 | |
| P | 1584830 | |
| N | 1192990 | |
| Y | 1191582 | |
| H | 533742 | 6.8% |
| O | 255993 | 3.3% |
| S | 254812 | 3.3% |
| E | 156015 | 2.0% |
| T | 150089 | 1.9% |
| R | 106950 | 1.4% |
| Other values (11) | 245574 | 3.1% |
Agency Name
Categorical
HIGH CORRELATION 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| New York City Police Department | |
|---|---|
| Department of Housing Preservation and Development | |
| Department of Sanitation | |
| Department of Transportation | |
| Department of Environmental Protection | |
| Other values (11) |
Length
| Max length | 50 |
|---|---|
| Median length | 44 |
| Mean length | 33.986435 |
| Min length | 23 |
Characters and Unicode
| Total characters | 73282638 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Department of Sanitation |
|---|---|
| 2nd row | Department of Sanitation |
| 3rd row | Department of Sanitation |
| 4th row | Department of Transportation |
| 5th row | New York City Police Department |
Common Values
| Value | Count | Frequency (%) |
| New York City Police Department | 974274 | |
| Department of Housing Preservation and Development | 376614 | 17.5% |
| Department of Sanitation | 217308 | 10.1% |
| Department of Transportation | 124456 | 5.8% |
| Department of Environmental Protection | 117798 | 5.5% |
| Department of Parks and Recreation | 105190 | 4.9% |
| Department of Buildings | 68736 | 3.2% |
| Department of Health and Mental Hygiene | 59988 | 2.8% |
| Department of Homeless Services | 37152 | 1.7% |
| Economic Development Corporation | 35460 | 1.6% |
| Other values (6) | 39256 | 1.8% |
Length
| Value | Count | Frequency (%) |
| department | 2096195 | |
| of | 1121977 | |
| new | 974274 | |
| city | 974274 | |
| police | 974274 | |
| york | 974274 | |
| and | 576267 | 5.7% |
| development | 412074 | 4.1% |
| preservation | 376614 | 3.7% |
| housing | 376614 | 3.7% |
| Other values (26) | 1294006 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9023739 | |
| 7994611 | ||
| t | 7274805 | 9.9% |
| o | 5494299 | 7.5% |
| n | 5431853 | 7.4% |
| r | 4674873 | 6.4% |
| a | 4245224 | 5.8% |
| i | 4044210 | 5.5% |
| m | 2785572 | 3.8% |
| p | 2668537 | 3.6% |
| Other values (30) | 19644915 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 56835428 | |
| Uppercase Letter | 8452599 | 11.5% |
| Space Separator | 7994611 | 10.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9023739 | |
| t | 7274805 | |
| o | 5494299 | |
| n | 5431853 | |
| r | 4674873 | |
| a | 4245224 | |
| i | 4044210 | |
| m | 2785572 | 4.9% |
| p | 2668537 | 4.7% |
| l | 1730418 | 3.0% |
| Other values (12) | 9461898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2508621 | |
| P | 1584126 | |
| C | 1048289 | |
| N | 974274 | 11.5% |
| Y | 974274 | 11.5% |
| H | 533742 | 6.3% |
| S | 254460 | 3.0% |
| E | 154255 | 1.8% |
| T | 148681 | 1.8% |
| R | 105190 | 1.2% |
| Other values (7) | 166687 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7994611 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 65288027 | |
| Common | 7994611 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9023739 | |
| t | 7274805 | |
| o | 5494299 | 8.4% |
| n | 5431853 | 8.3% |
| r | 4674873 | 7.2% |
| a | 4245224 | 6.5% |
| i | 4044210 | 6.2% |
| m | 2785572 | 4.3% |
| p | 2668537 | 4.1% |
| D | 2508621 | 3.8% |
| Other values (29) | 17136294 |
Common
| Value | Count | Frequency (%) |
| 7994611 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73282638 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 9023739 | |
| 7994611 | ||
| t | 7274805 | 9.9% |
| o | 5494299 | 7.5% |
| n | 5431853 | 7.4% |
| r | 4674873 | 6.4% |
| a | 4245224 | 5.8% |
| i | 4044210 | 5.5% |
| m | 2785572 | 3.8% |
| p | 2668537 | 3.6% |
| Other values (30) | 19644915 |
Complaint Type
Text
| Distinct | 192 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 31 |
| Mean length | 16.263301 |
| Min length | 4 |
Characters and Unicode
| Total characters | 35067451 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Derelict Vehicles |
|---|---|
| 2nd row | Derelict Vehicles |
| 3rd row | Derelict Vehicles |
| 4th row | Street Condition |
| 5th row | Panhandling |
| Value | Count | Frequency (%) |
| noise | 497496 | 10.7% |
| 469661 | 10.1% | |
| illegal | 358185 | 7.7% |
| parking | 319523 | 6.9% |
| condition | 232766 | 5.0% |
| residential | 215403 | 4.6% |
| water | 155275 | 3.3% |
| street/sidewalk | 125895 | 2.7% |
| blocked | 108159 | 2.3% |
| driveway | 108159 | 2.3% |
| Other values (249) | 2071336 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3660737 | 10.4% |
| i | 2814143 | 8.0% |
| 2505626 | 7.1% | |
| l | 2209444 | 6.3% |
| a | 1833101 | 5.2% |
| n | 1710817 | 4.9% |
| o | 1652360 | 4.7% |
| t | 1539141 | 4.4% |
| r | 1400510 | 4.0% |
| s | 1286839 | 3.7% |
| Other values (46) | 14454733 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23291573 | |
| Uppercase Letter | 8386045 | 23.9% |
| Space Separator | 2505626 | 7.1% |
| Dash Punctuation | 491995 | 1.4% |
| Other Punctuation | 378574 | 1.1% |
| Open Punctuation | 6819 | < 0.1% |
| Close Punctuation | 6819 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3660737 | |
| i | 2814143 | |
| l | 2209444 | |
| a | 1833101 | |
| n | 1710817 | |
| o | 1652360 | 7.1% |
| t | 1539141 | 6.6% |
| r | 1400510 | 6.0% |
| s | 1286839 | 5.5% |
| g | 929577 | 4.0% |
| Other values (16) | 4254904 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1006834 | |
| I | 785935 | 9.4% |
| T | 683240 | 8.1% |
| A | 670063 | 8.0% |
| S | 638768 | 7.6% |
| R | 620717 | 7.4% |
| P | 584643 | 7.0% |
| C | 482809 | 5.8% |
| E | 455104 | 5.4% |
| D | 441842 | 5.3% |
| Other values (15) | 2016090 |
Space Separator
| Value | Count | Frequency (%) |
| 2505626 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 491995 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 378574 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6819 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6819 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31677618 | |
| Common | 3389833 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3660737 | 11.6% |
| i | 2814143 | 8.9% |
| l | 2209444 | 7.0% |
| a | 1833101 | 5.8% |
| n | 1710817 | 5.4% |
| o | 1652360 | 5.2% |
| t | 1539141 | 4.9% |
| r | 1400510 | 4.4% |
| s | 1286839 | 4.1% |
| N | 1006834 | 3.2% |
| Other values (41) | 12563692 |
Common
| Value | Count | Frequency (%) |
| 2505626 | ||
| - | 491995 | 14.5% |
| / | 378574 | 11.2% |
| ( | 6819 | 0.2% |
| ) | 6819 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35067451 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3660737 | 10.4% |
| i | 2814143 | 8.0% |
| 2505626 | 7.1% | |
| l | 2209444 | 6.3% |
| a | 1833101 | 5.2% |
| n | 1710817 | 4.9% |
| o | 1652360 | 4.7% |
| t | 1539141 | 4.4% |
| r | 1400510 | 4.0% |
| s | 1286839 | 3.7% |
| Other values (46) | 14454733 |
Descriptor
Text
MISSING 
| Distinct | 954 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30750 |
| Missing (%) | 1.4% |
| Memory size | 16.5 MiB |
Length
| Max length | 80 |
|---|---|
| Median length | 66 |
| Mean length | 16.874297 |
| Min length | 3 |
Characters and Unicode
| Total characters | 35866014 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Derelict Vehicles |
|---|---|
| 2nd row | Derelict Vehicles |
| 3rd row | Derelict Vehicles |
| 4th row | Pothole |
| 5th row | N/A |
| Value | Count | Frequency (%) |
| loud | 320032 | 6.3% |
| music/party | 274081 | 5.4% |
| blocked | 169062 | 3.4% |
| parking | 112167 | 2.2% |
| hydrant | 111193 | 2.2% |
| access | 108161 | 2.1% |
| 99031 | 2.0% | |
| no | 89701 | 1.8% |
| violation | 83299 | 1.7% |
| sign | 77292 | 1.5% |
| Other values (1258) | 3597335 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2916534 | 8.1% | |
| e | 2423526 | 6.8% |
| i | 2149973 | 6.0% |
| o | 1896746 | 5.3% |
| a | 1801732 | 5.0% |
| r | 1741036 | 4.9% |
| n | 1730264 | 4.8% |
| t | 1690081 | 4.7% |
| s | 1404691 | 3.9% |
| c | 1253910 | 3.5% |
| Other values (63) | 16857521 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23081875 | |
| Uppercase Letter | 8628320 | 24.1% |
| Space Separator | 2916534 | 8.1% |
| Other Punctuation | 686172 | 1.9% |
| Open Punctuation | 164488 | 0.5% |
| Close Punctuation | 164488 | 0.5% |
| Dash Punctuation | 114140 | 0.3% |
| Decimal Number | 109997 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2423526 | |
| i | 2149973 | 9.3% |
| o | 1896746 | 8.2% |
| a | 1801732 | 7.8% |
| r | 1741036 | 7.5% |
| n | 1730264 | 7.5% |
| t | 1690081 | 7.3% |
| s | 1404691 | 6.1% |
| c | 1253910 | 5.4% |
| d | 1214120 | 5.3% |
| Other values (16) | 5775796 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 920445 | 10.7% |
| L | 768660 | 8.9% |
| N | 558815 | 6.5% |
| B | 538023 | 6.2% |
| S | 533975 | 6.2% |
| A | 525541 | 6.1% |
| T | 499315 | 5.8% |
| E | 482146 | 5.6% |
| R | 460168 | 5.3% |
| O | 451534 | 5.2% |
| Other values (16) | 2889698 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 42557 | |
| 2 | 20730 | |
| 3 | 16181 | 14.7% |
| 4 | 12668 | 11.5% |
| 0 | 8694 | 7.9% |
| 5 | 6207 | 5.6% |
| 8 | 1109 | 1.0% |
| 9 | 1063 | 1.0% |
| 6 | 526 | 0.5% |
| 7 | 262 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 622212 | |
| : | 36783 | 5.4% |
| , | 25678 | 3.7% |
| . | 1362 | 0.2% |
| & | 121 | < 0.1% |
| " | 10 | < 0.1% |
| * | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2916534 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 164488 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 164488 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 114140 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31710195 | |
| Common | 4155819 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2423526 | 7.6% |
| i | 2149973 | 6.8% |
| o | 1896746 | 6.0% |
| a | 1801732 | 5.7% |
| r | 1741036 | 5.5% |
| n | 1730264 | 5.5% |
| t | 1690081 | 5.3% |
| s | 1404691 | 4.4% |
| c | 1253910 | 4.0% |
| d | 1214120 | 3.8% |
| Other values (42) | 14404116 |
Common
| Value | Count | Frequency (%) |
| 2916534 | ||
| / | 622212 | 15.0% |
| ( | 164488 | 4.0% |
| ) | 164488 | 4.0% |
| - | 114140 | 2.7% |
| 1 | 42557 | 1.0% |
| : | 36783 | 0.9% |
| , | 25678 | 0.6% |
| 2 | 20730 | 0.5% |
| 3 | 16181 | 0.4% |
| Other values (11) | 32028 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35866014 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2916534 | 8.1% | |
| e | 2423526 | 6.8% |
| i | 2149973 | 6.0% |
| o | 1896746 | 5.3% |
| a | 1801732 | 5.0% |
| r | 1741036 | 4.9% |
| n | 1730264 | 4.8% |
| t | 1690081 | 4.7% |
| s | 1404691 | 3.9% |
| c | 1253910 | 3.5% |
| Other values (63) | 16857521 |
Location Type
Text
MISSING 
| Distinct | 147 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 256708 |
| Missing (%) | 11.9% |
| Memory size | 16.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 30 |
| Mean length | 15.615429 |
| Min length | 3 |
Characters and Unicode
| Total characters | 29661882 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Street |
|---|---|
| 2nd row | Street |
| 3rd row | Street |
| 4th row | Subway |
| 5th row | Street/Sidewalk |
| Value | Count | Frequency (%) |
| street/sidewalk | 695397 | |
| residential | 607588 | |
| building | 404790 | |
| street | 250836 | 9.4% |
| building/house | 227096 | 8.5% |
| sidewalk | 99034 | 3.7% |
| store/commercial | 37038 | 1.4% |
| above | 35458 | 1.3% |
| address | 35458 | 1.3% |
| family | 29291 | 1.1% |
| Other values (166) | 238878 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3656769 | 12.3% |
| t | 2270212 | 7.7% |
| S | 2172541 | 7.3% |
| i | 1902191 | 6.4% |
| I | 1506784 | 5.1% |
| l | 1434948 | 4.8% |
| d | 1384470 | 4.7% |
| a | 1293105 | 4.4% |
| r | 1217254 | 4.1% |
| / | 1046003 | 3.5% |
| Other values (48) | 11777605 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17681607 | |
| Uppercase Letter | 10078370 | |
| Other Punctuation | 1060120 | 3.6% |
| Space Separator | 761340 | 2.6% |
| Decimal Number | 37863 | 0.1% |
| Math Symbol | 20777 | 0.1% |
| Dash Punctuation | 8863 | < 0.1% |
| Open Punctuation | 6471 | < 0.1% |
| Close Punctuation | 6471 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3656769 | |
| t | 2270212 | |
| i | 1902191 | |
| l | 1434948 | 8.1% |
| d | 1384470 | 7.8% |
| a | 1293105 | 7.3% |
| r | 1217254 | 6.9% |
| k | 840891 | 4.8% |
| w | 824925 | 4.7% |
| s | 616029 | 3.5% |
| Other values (14) | 2240813 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2172541 | |
| I | 1506784 | |
| D | 770036 | 7.6% |
| E | 759656 | 7.5% |
| L | 756334 | 7.5% |
| N | 753488 | 7.5% |
| B | 693234 | 6.9% |
| R | 639255 | 6.3% |
| A | 469106 | 4.7% |
| U | 384181 | 3.8% |
| Other values (13) | 1173755 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1046003 | |
| . | 14115 | 1.3% |
| ' | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 20842 | |
| 1 | 8543 | |
| 2 | 8478 |
Space Separator
| Value | Count | Frequency (%) |
| 761340 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 20777 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8863 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6471 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6471 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27759977 | |
| Common | 1901905 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3656769 | 13.2% |
| t | 2270212 | 8.2% |
| S | 2172541 | 7.8% |
| i | 1902191 | 6.9% |
| I | 1506784 | 5.4% |
| l | 1434948 | 5.2% |
| d | 1384470 | 5.0% |
| a | 1293105 | 4.7% |
| r | 1217254 | 4.4% |
| k | 840891 | 3.0% |
| Other values (37) | 10080812 |
Common
| Value | Count | Frequency (%) |
| / | 1046003 | |
| 761340 | ||
| 3 | 20842 | 1.1% |
| + | 20777 | 1.1% |
| . | 14115 | 0.7% |
| - | 8863 | 0.5% |
| 1 | 8543 | 0.4% |
| 2 | 8478 | 0.4% |
| ( | 6471 | 0.3% |
| ) | 6471 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29661882 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3656769 | 12.3% |
| t | 2270212 | 7.7% |
| S | 2172541 | 7.3% |
| i | 1902191 | 6.4% |
| I | 1506784 | 5.1% |
| l | 1434948 | 4.8% |
| d | 1384470 | 4.7% |
| a | 1293105 | 4.4% |
| r | 1217254 | 4.1% |
| / | 1046003 | 3.5% |
| Other values (48) | 11777605 |
Incident Zip
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 348 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 27020 |
| Missing (%) | 1.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10824.947 |
| Minimum | 83 |
|---|---|
| Maximum | 98057 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 83 |
|---|---|
| 5-th percentile | 10014 |
| Q1 | 10314 |
| median | 11204 |
| Q3 | 11235 |
| 95-th percentile | 11421 |
| Maximum | 98057 |
| Range | 97974 |
| Interquartile range (IQR) | 921 |
Descriptive statistics
| Standard deviation | 582.63775 |
|---|---|
| Coefficient of variation (CV) | 0.053823611 |
| Kurtosis | 1792.3152 |
| Mean | 10824.947 |
| Median Absolute Deviation (MAD) | 216 |
| Skewness | 14.265663 |
| Sum | 2.3048608 × 1010 |
| Variance | 339466.75 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11226 | 32214 | 1.5% |
| 11385 | 30213 | 1.4% |
| 10467 | 29812 | 1.4% |
| 11201 | 29122 | 1.4% |
| 10452 | 28966 | 1.3% |
| 10468 | 28517 | 1.3% |
| 10456 | 27647 | 1.3% |
| 10457 | 27632 | 1.3% |
| 11207 | 27550 | 1.3% |
| 11208 | 25684 | 1.2% |
| Other values (338) | 1841855 | |
| (Missing) | 27020 | 1.3% |
| Value | Count | Frequency (%) |
| 83 | 2 | |
| 2062 | 2 | |
| 3833 | 1 | |
| 7002 | 1 | |
| 7017 | 1 | |
| 7052 | 1 | |
| 7072 | 1 | |
| 7080 | 1 | |
| 7083 | 1 | |
| 7104 | 1 |
| Value | Count | Frequency (%) |
| 98057 | 2 | |
| 95834 | 1 | |
| 94804 | 1 | |
| 91302 | 1 | |
| 84117 | 1 | |
| 82001 | 1 | |
| 78758 | 2 | |
| 75007 | 1 | |
| 75001 | 1 | |
| 60604 | 1 |
Incident Address
Text
MISSING 
| Distinct | 451975 |
|---|---|
| Distinct (%) | 21.8% |
| Missing | 80531 |
| Missing (%) | 3.7% |
| Memory size | 16.5 MiB |
Length
| Max length | 88 |
|---|---|
| Median length | 51 |
| Mean length | 17.810209 |
| Min length | 1 |
Characters and Unicode
| Total characters | 36968668 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 218095 ? |
|---|---|
| Unique (%) | 10.5% |
Sample
| 1st row | 585 BRISTOL STREET |
|---|---|
| 2nd row | 2362 EAST 13 STREET |
| 3rd row | 34 HILLSIDE AVENUE |
| 4th row | CRESCENT STREET |
| 5th row | 637 EAST 230 STREET |
| Value | Count | Frequency (%) |
| street | 903176 | 13.7% |
| avenue | 797956 | 12.1% |
| east | 206310 | 3.1% |
| west | 174541 | 2.6% |
| boulevard | 73339 | 1.1% |
| place | 69854 | 1.1% |
| road | 67883 | 1.0% |
| park | 31620 | 0.5% |
| broadway | 29493 | 0.4% |
| parkway | 25542 | 0.4% |
| Other values (33245) | 4235393 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5157816 | ||
| E | 4881406 | 13.2% |
| T | 2759937 | 7.5% |
| A | 2117757 | 5.7% |
| R | 1930669 | 5.2% |
| S | 1838185 | 5.0% |
| 1 | 1714844 | 4.6% |
| N | 1620544 | 4.4% |
| 2 | 1114775 | 3.0% |
| U | 1107601 | 3.0% |
| Other values (63) | 12725134 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 22739083 | |
| Decimal Number | 8618360 | 23.3% |
| Space Separator | 5157816 | 14.0% |
| Dash Punctuation | 447491 | 1.2% |
| Lowercase Letter | 4854 | < 0.1% |
| Other Punctuation | 1059 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4881406 | |
| T | 2759937 | |
| A | 2117757 | |
| R | 1930669 | 8.5% |
| S | 1838185 | 8.1% |
| N | 1620544 | 7.1% |
| U | 1107601 | 4.9% |
| V | 981905 | 4.3% |
| O | 907247 | 4.0% |
| L | 655855 | 2.9% |
| Other values (16) | 3937977 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 640 | |
| t | 482 | |
| r | 474 | |
| a | 467 | |
| n | 378 | 7.8% |
| o | 361 | 7.4% |
| s | 281 | 5.8% |
| d | 242 | 5.0% |
| i | 196 | 4.0% |
| l | 189 | 3.9% |
| Other values (16) | 1144 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1714844 | |
| 2 | 1114775 | |
| 0 | 890258 | |
| 3 | 870816 | |
| 5 | 843257 | |
| 4 | 790347 | |
| 6 | 649003 | 7.5% |
| 7 | 624882 | 7.3% |
| 8 | 590150 | 6.8% |
| 9 | 530028 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 638 | |
| / | 393 | |
| . | 14 | 1.3% |
| , | 11 | 1.0% |
| @ | 2 | 0.2% |
| # | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5157816 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 447491 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22743937 | |
| Common | 14224731 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4881406 | |
| T | 2759937 | |
| A | 2117757 | |
| R | 1930669 | 8.5% |
| S | 1838185 | 8.1% |
| N | 1620544 | 7.1% |
| U | 1107601 | 4.9% |
| V | 981905 | 4.3% |
| O | 907247 | 4.0% |
| L | 655855 | 2.9% |
| Other values (42) | 3942831 |
Common
| Value | Count | Frequency (%) |
| 5157816 | ||
| 1 | 1714844 | 12.1% |
| 2 | 1114775 | 7.8% |
| 0 | 890258 | 6.3% |
| 3 | 870816 | 6.1% |
| 5 | 843257 | 5.9% |
| 4 | 790347 | 5.6% |
| 6 | 649003 | 4.6% |
| 7 | 624882 | 4.4% |
| 8 | 590150 | 4.1% |
| Other values (11) | 978583 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36968668 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5157816 | ||
| E | 4881406 | 13.2% |
| T | 2759937 | 7.5% |
| A | 2117757 | 5.7% |
| R | 1930669 | 5.2% |
| S | 1838185 | 5.0% |
| 1 | 1714844 | 4.6% |
| N | 1620544 | 4.4% |
| 2 | 1114775 | 3.0% |
| U | 1107601 | 3.0% |
| Other values (63) | 12725134 |
Street Name
Text
MISSING 
| Distinct | 11264 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 80593 |
| Missing (%) | 3.7% |
| Memory size | 16.5 MiB |
Length
| Max length | 70 |
|---|---|
| Median length | 47 |
| Mean length | 13.410296 |
| Min length | 2 |
Characters and Unicode
| Total characters | 27834934 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1982 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | BRISTOL STREET |
|---|---|
| 2nd row | EAST 13 STREET |
| 3rd row | HILLSIDE AVENUE |
| 4th row | CRESCENT STREET |
| 5th row | EAST 230 STREET |
| Value | Count | Frequency (%) |
| street | 903174 | 19.4% |
| avenue | 797952 | 17.1% |
| east | 206292 | 4.4% |
| west | 174540 | 3.7% |
| boulevard | 73339 | 1.6% |
| place | 69854 | 1.5% |
| road | 67883 | 1.5% |
| park | 31613 | 0.7% |
| broadway | 29491 | 0.6% |
| parkway | 25542 | 0.5% |
| Other values (5884) | 2287334 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4880887 | |
| 3209785 | ||
| T | 2759873 | |
| A | 2113179 | 7.6% |
| R | 1930523 | 6.9% |
| S | 1838126 | 6.6% |
| N | 1620409 | 5.8% |
| U | 1107591 | 4.0% |
| V | 981881 | 3.5% |
| O | 907168 | 3.3% |
| Other values (63) | 6485512 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 22731109 | |
| Space Separator | 3209785 | 11.5% |
| Decimal Number | 1889374 | 6.8% |
| Lowercase Letter | 3843 | < 0.1% |
| Other Punctuation | 687 | < 0.1% |
| Dash Punctuation | 131 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4880887 | |
| T | 2759873 | |
| A | 2113179 | |
| R | 1930523 | 8.5% |
| S | 1838126 | 8.1% |
| N | 1620409 | 7.1% |
| U | 1107591 | 4.9% |
| V | 981881 | 4.3% |
| O | 907168 | 4.0% |
| L | 655844 | 2.9% |
| Other values (16) | 3935628 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 553 | |
| t | 400 | |
| r | 344 | 9.0% |
| a | 340 | 8.8% |
| o | 293 | 7.6% |
| n | 292 | 7.6% |
| s | 231 | 6.0% |
| d | 181 | 4.7% |
| i | 156 | 4.1% |
| u | 154 | 4.0% |
| Other values (16) | 899 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 445566 | |
| 2 | 225880 | |
| 3 | 188638 | |
| 4 | 169179 | 9.0% |
| 5 | 160877 | 8.5% |
| 7 | 159842 | 8.5% |
| 6 | 146560 | 7.8% |
| 8 | 146120 | 7.7% |
| 9 | 131451 | 7.0% |
| 0 | 115261 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 638 | |
| / | 31 | 4.5% |
| . | 9 | 1.3% |
| , | 7 | 1.0% |
| # | 1 | 0.1% |
| @ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3209785 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 131 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22734952 | |
| Common | 5099982 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4880887 | |
| T | 2759873 | |
| A | 2113179 | |
| R | 1930523 | 8.5% |
| S | 1838126 | 8.1% |
| N | 1620409 | 7.1% |
| U | 1107591 | 4.9% |
| V | 981881 | 4.3% |
| O | 907168 | 4.0% |
| L | 655844 | 2.9% |
| Other values (42) | 3939471 |
Common
| Value | Count | Frequency (%) |
| 3209785 | ||
| 1 | 445566 | 8.7% |
| 2 | 225880 | 4.4% |
| 3 | 188638 | 3.7% |
| 4 | 169179 | 3.3% |
| 5 | 160877 | 3.2% |
| 7 | 159842 | 3.1% |
| 6 | 146560 | 2.9% |
| 8 | 146120 | 2.9% |
| 9 | 131451 | 2.6% |
| Other values (11) | 116084 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27834934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 4880887 | |
| 3209785 | ||
| T | 2759873 | |
| A | 2113179 | 7.6% |
| R | 1930523 | 6.9% |
| S | 1838126 | 6.6% |
| N | 1620409 | 5.8% |
| U | 1107591 | 4.0% |
| V | 981881 | 3.5% |
| O | 907168 | 3.3% |
| Other values (63) | 6485512 |
Cross Street 1
Text
MISSING 
| Distinct | 15985 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 539647 |
| Missing (%) | 25.0% |
| Memory size | 16.5 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 30 |
| Mean length | 12.717315 |
| Min length | 2 |
Characters and Unicode
| Total characters | 20558620 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2450 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | LOTT AVENUE |
|---|---|
| 2nd row | GRAVESEND NECK ROAD |
| 3rd row | BOGARDUS PLACE |
| 4th row | 23 AVENUE |
| 5th row | CARPENTER AVENUE |
| Value | Count | Frequency (%) |
| avenue | 687841 | 19.0% |
| street | 531633 | 14.7% |
| east | 126373 | 3.5% |
| west | 99332 | 2.7% |
| road | 58886 | 1.6% |
| st | 57238 | 1.6% |
| place | 53827 | 1.5% |
| ave | 47567 | 1.3% |
| boulevard | 46448 | 1.3% |
| broadway | 23276 | 0.6% |
| Other values (5642) | 1891794 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3573253 | |
| 2316211 | ||
| T | 1774615 | 8.6% |
| A | 1716435 | 8.3% |
| N | 1299697 | 6.3% |
| R | 1290307 | 6.3% |
| S | 1187987 | 5.8% |
| U | 915040 | 4.5% |
| V | 861292 | 4.2% |
| O | 641208 | 3.1% |
| Other values (55) | 4982575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16759002 | |
| Space Separator | 2316211 | 11.3% |
| Decimal Number | 1476750 | 7.2% |
| Dash Punctuation | 3228 | < 0.1% |
| Other Punctuation | 3079 | < 0.1% |
| Lowercase Letter | 272 | < 0.1% |
| Open Punctuation | 39 | < 0.1% |
| Close Punctuation | 39 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3573253 | |
| T | 1774615 | |
| A | 1716435 | |
| N | 1299697 | 7.8% |
| R | 1290307 | 7.7% |
| S | 1187987 | 7.1% |
| U | 915040 | 5.5% |
| V | 861292 | 5.1% |
| O | 641208 | 3.8% |
| L | 530024 | 3.2% |
| Other values (16) | 2969144 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 48 | |
| i | 46 | |
| x | 35 | |
| l | 24 | |
| r | 19 | 7.0% |
| e | 18 | 6.6% |
| v | 13 | 4.8% |
| n | 10 | 3.7% |
| a | 10 | 3.7% |
| d | 8 | 2.9% |
| Other values (11) | 41 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 341635 | |
| 2 | 161860 | |
| 3 | 141481 | |
| 8 | 134632 | 9.1% |
| 4 | 124402 | 8.4% |
| 5 | 124068 | 8.4% |
| 7 | 123654 | 8.4% |
| 6 | 115896 | 7.8% |
| 9 | 105203 | 7.1% |
| 0 | 103919 | 7.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 1716 | |
| / | 810 | |
| ' | 521 | 16.9% |
| & | 32 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2316211 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3228 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 39 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16759274 | |
| Common | 3799346 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3573253 | |
| T | 1774615 | |
| A | 1716435 | |
| N | 1299697 | 7.8% |
| R | 1290307 | 7.7% |
| S | 1187987 | 7.1% |
| U | 915040 | 5.5% |
| V | 861292 | 5.1% |
| O | 641208 | 3.8% |
| L | 530024 | 3.2% |
| Other values (37) | 2969416 |
Common
| Value | Count | Frequency (%) |
| 2316211 | ||
| 1 | 341635 | 9.0% |
| 2 | 161860 | 4.3% |
| 3 | 141481 | 3.7% |
| 8 | 134632 | 3.5% |
| 4 | 124402 | 3.3% |
| 5 | 124068 | 3.3% |
| 7 | 123654 | 3.3% |
| 6 | 115896 | 3.1% |
| 9 | 105203 | 2.8% |
| Other values (8) | 110304 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20558620 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3573253 | |
| 2316211 | ||
| T | 1774615 | 8.6% |
| A | 1716435 | 8.3% |
| N | 1299697 | 6.3% |
| R | 1290307 | 6.3% |
| S | 1187987 | 5.8% |
| U | 915040 | 4.5% |
| V | 861292 | 4.2% |
| O | 641208 | 3.1% |
| Other values (55) | 4982575 |
Cross Street 2
Text
MISSING 
| Distinct | 15959 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 538899 |
| Missing (%) | 25.0% |
| Memory size | 16.5 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 29 |
| Mean length | 12.970895 |
| Min length | 1 |
Characters and Unicode
| Total characters | 20978257 |
|---|---|
| Distinct characters | 67 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2466 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | HEGEMAN AVENUE |
|---|---|
| 2nd row | AVENUE X |
| 3rd row | ELLWOOD STREET |
| 4th row | DITMARS BOULEVARD |
| 5th row | LOWERRE PLACE |
| Value | Count | Frequency (%) |
| avenue | 662056 | 18.1% |
| street | 548494 | 15.0% |
| east | 131492 | 3.6% |
| west | 104848 | 2.9% |
| road | 63742 | 1.7% |
| boulevard | 51986 | 1.4% |
| place | 49884 | 1.4% |
| st | 48659 | 1.3% |
| ave | 45191 | 1.2% |
| broadway | 19585 | 0.5% |
| Other values (5527) | 1929583 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3613685 | |
| 2378467 | ||
| T | 1828504 | 8.7% |
| A | 1706950 | 8.1% |
| R | 1354117 | 6.5% |
| N | 1299169 | 6.2% |
| S | 1233871 | 5.9% |
| U | 901351 | 4.3% |
| V | 861329 | 4.1% |
| O | 686275 | 3.3% |
| Other values (57) | 5114539 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 17122060 | |
| Space Separator | 2378467 | 11.3% |
| Decimal Number | 1469159 | 7.0% |
| Dash Punctuation | 4271 | < 0.1% |
| Other Punctuation | 3909 | < 0.1% |
| Lowercase Letter | 311 | < 0.1% |
| Open Punctuation | 40 | < 0.1% |
| Close Punctuation | 40 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3613685 | |
| T | 1828504 | |
| A | 1706950 | |
| R | 1354117 | 7.9% |
| N | 1299169 | 7.6% |
| S | 1233871 | 7.2% |
| U | 901351 | 5.3% |
| V | 861329 | 5.0% |
| O | 686275 | 4.0% |
| L | 536211 | 3.1% |
| Other values (16) | 3100598 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 59 | |
| i | 41 | |
| x | 35 | |
| e | 21 | 6.8% |
| r | 17 | 5.5% |
| l | 16 | 5.1% |
| o | 16 | 5.1% |
| a | 14 | 4.5% |
| k | 12 | 3.9% |
| n | 12 | 3.9% |
| Other values (13) | 68 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 357946 | |
| 2 | 166316 | |
| 3 | 136011 | 9.3% |
| 7 | 128621 | 8.8% |
| 4 | 123469 | 8.4% |
| 8 | 122564 | 8.3% |
| 5 | 114301 | 7.8% |
| 6 | 111245 | 7.6% |
| 9 | 105475 | 7.2% |
| 0 | 103211 | 7.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3038 | |
| ' | 504 | 12.9% |
| ? | 364 | 9.3% |
| & | 3 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2378467 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4271 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 40 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 40 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17122371 | |
| Common | 3855886 | 18.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3613685 | |
| T | 1828504 | |
| A | 1706950 | |
| R | 1354117 | 7.9% |
| N | 1299169 | 7.6% |
| S | 1233871 | 7.2% |
| U | 901351 | 5.3% |
| V | 861329 | 5.0% |
| O | 686275 | 4.0% |
| L | 536211 | 3.1% |
| Other values (39) | 3100909 |
Common
| Value | Count | Frequency (%) |
| 2378467 | ||
| 1 | 357946 | 9.3% |
| 2 | 166316 | 4.3% |
| 3 | 136011 | 3.5% |
| 7 | 128621 | 3.3% |
| 4 | 123469 | 3.2% |
| 8 | 122564 | 3.2% |
| 5 | 114301 | 3.0% |
| 6 | 111245 | 2.9% |
| 9 | 105475 | 2.7% |
| Other values (8) | 111471 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20978257 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3613685 | |
| 2378467 | ||
| T | 1828504 | 8.7% |
| A | 1706950 | 8.1% |
| R | 1354117 | 6.5% |
| N | 1299169 | 6.2% |
| S | 1233871 | 5.9% |
| U | 901351 | 4.3% |
| V | 861329 | 4.1% |
| O | 686275 | 3.3% |
| Other values (57) | 5114539 |
MISSING 
| Distinct | 10177 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 635953 |
| Missing (%) | 29.5% |
| Memory size | 16.5 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 30 |
| Mean length | 12.993419 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19753622 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1322 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | CARPENTER AVENUE |
|---|---|
| 2nd row | WEST 16 STREET |
| 3rd row | AVENUE X |
| 4th row | AMSTERDAM AVENUE |
| 5th row | MCCLELLAN STREET |
| Value | Count | Frequency (%) |
| avenue | 691312 | 20.3% |
| street | 533551 | 15.7% |
| east | 126781 | 3.7% |
| west | 100046 | 2.9% |
| road | 59347 | 1.7% |
| place | 53615 | 1.6% |
| boulevard | 48982 | 1.4% |
| broadway | 22952 | 0.7% |
| park | 21927 | 0.6% |
| 5 | 20242 | 0.6% |
| Other values (5274) | 1729822 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3486229 | |
| 2212158 | ||
| T | 1721138 | 8.7% |
| A | 1641329 | 8.3% |
| N | 1263288 | 6.4% |
| R | 1256520 | 6.4% |
| S | 1134391 | 5.7% |
| U | 913367 | 4.6% |
| V | 814043 | 4.1% |
| O | 614901 | 3.1% |
| Other values (51) | 4696258 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16151747 | |
| Space Separator | 2212158 | 11.2% |
| Decimal Number | 1385143 | 7.0% |
| Dash Punctuation | 2980 | < 0.1% |
| Other Punctuation | 1383 | < 0.1% |
| Lowercase Letter | 207 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3486229 | |
| T | 1721138 | |
| A | 1641329 | |
| N | 1263288 | 7.8% |
| R | 1256520 | 7.8% |
| S | 1134391 | 7.0% |
| U | 913367 | 5.7% |
| V | 814043 | 5.0% |
| O | 614901 | 3.8% |
| L | 498477 | 3.1% |
| Other values (16) | 2808064 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 29 | |
| u | 24 | |
| n | 24 | |
| t | 22 | |
| d | 20 | |
| b | 17 | |
| s | 16 | |
| e | 11 | 5.3% |
| a | 10 | 4.8% |
| r | 6 | 2.9% |
| Other values (8) | 28 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 319227 | |
| 2 | 151930 | |
| 3 | 133753 | |
| 8 | 126625 | 9.1% |
| 5 | 116230 | 8.4% |
| 7 | 115634 | 8.3% |
| 4 | 115187 | 8.3% |
| 6 | 109416 | 7.9% |
| 9 | 99451 | 7.2% |
| 0 | 97690 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 930 | |
| ' | 421 | |
| & | 32 | 2.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2212158 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2980 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16151954 | |
| Common | 3601668 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3486229 | |
| T | 1721138 | |
| A | 1641329 | |
| N | 1263288 | 7.8% |
| R | 1256520 | 7.8% |
| S | 1134391 | 7.0% |
| U | 913367 | 5.7% |
| V | 814043 | 5.0% |
| O | 614901 | 3.8% |
| L | 498477 | 3.1% |
| Other values (34) | 2808271 |
Common
| Value | Count | Frequency (%) |
| 2212158 | ||
| 1 | 319227 | 8.9% |
| 2 | 151930 | 4.2% |
| 3 | 133753 | 3.7% |
| 8 | 126625 | 3.5% |
| 5 | 116230 | 3.2% |
| 7 | 115634 | 3.2% |
| 4 | 115187 | 3.2% |
| 6 | 109416 | 3.0% |
| 9 | 99451 | 2.8% |
| Other values (7) | 102057 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19753622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3486229 | |
| 2212158 | ||
| T | 1721138 | 8.7% |
| A | 1641329 | 8.3% |
| N | 1263288 | 6.4% |
| R | 1256520 | 6.4% |
| S | 1134391 | 5.7% |
| U | 913367 | 4.6% |
| V | 814043 | 4.1% |
| O | 614901 | 3.1% |
| Other values (51) | 4696258 |
MISSING 
| Distinct | 10501 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 634646 |
| Missing (%) | 29.4% |
| Memory size | 16.5 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 30 |
| Mean length | 13.26327 |
| Min length | 3 |
Characters and Unicode
| Total characters | 20181206 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1416 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | LOWERRE PLACE |
|---|---|
| 2nd row | PRIVATE CATANZARO SQUARE |
| 3rd row | AVENUE Y |
| 4th row | BROADWAY |
| 5th row | EAST 167 STREET |
| Value | Count | Frequency (%) |
| avenue | 656512 | 19.0% |
| street | 562824 | 16.3% |
| east | 135741 | 3.9% |
| west | 108557 | 3.1% |
| road | 63167 | 1.8% |
| boulevard | 52905 | 1.5% |
| place | 49763 | 1.4% |
| drive | 19029 | 0.6% |
| broadway | 18392 | 0.5% |
| parkway | 18034 | 0.5% |
| Other values (5382) | 1763351 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3536797 | |
| 2280702 | ||
| T | 1802375 | 8.9% |
| A | 1620638 | 8.0% |
| R | 1324238 | 6.6% |
| N | 1249806 | 6.2% |
| S | 1195936 | 5.9% |
| U | 885956 | 4.4% |
| V | 803988 | 4.0% |
| O | 654836 | 3.2% |
| Other values (50) | 4825934 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16491303 | |
| Space Separator | 2280702 | 11.3% |
| Decimal Number | 1401426 | 6.9% |
| Dash Punctuation | 4084 | < 0.1% |
| Other Punctuation | 3492 | < 0.1% |
| Lowercase Letter | 153 | < 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Close Punctuation | 23 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3536797 | |
| T | 1802375 | |
| A | 1620638 | |
| R | 1324238 | 8.0% |
| N | 1249806 | 7.6% |
| S | 1195936 | 7.3% |
| U | 885956 | 5.4% |
| V | 803988 | 4.9% |
| O | 654836 | 4.0% |
| L | 502203 | 3.0% |
| Other values (16) | 2914530 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 29 | |
| i | 22 | |
| e | 20 | |
| x | 17 | |
| s | 8 | 5.2% |
| r | 8 | 5.2% |
| v | 8 | 5.2% |
| l | 8 | 5.2% |
| a | 8 | 5.2% |
| n | 6 | 3.9% |
| Other values (8) | 19 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 341300 | |
| 2 | 157458 | |
| 3 | 130002 | 9.3% |
| 7 | 122383 | 8.7% |
| 4 | 117117 | 8.4% |
| 8 | 116715 | 8.3% |
| 5 | 109728 | 7.8% |
| 6 | 106365 | 7.6% |
| 9 | 101381 | 7.2% |
| 0 | 98977 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3061 | |
| ' | 431 | 12.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2280702 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4084 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16491456 | |
| Common | 3689750 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3536797 | |
| T | 1802375 | |
| A | 1620638 | |
| R | 1324238 | 8.0% |
| N | 1249806 | 7.6% |
| S | 1195936 | 7.3% |
| U | 885956 | 5.4% |
| V | 803988 | 4.9% |
| O | 654836 | 4.0% |
| L | 502203 | 3.0% |
| Other values (34) | 2914683 |
Common
| Value | Count | Frequency (%) |
| 2280702 | ||
| 1 | 341300 | 9.2% |
| 2 | 157458 | 4.3% |
| 3 | 130002 | 3.5% |
| 7 | 122383 | 3.3% |
| 4 | 117117 | 3.2% |
| 8 | 116715 | 3.2% |
| 5 | 109728 | 3.0% |
| 6 | 106365 | 2.9% |
| 9 | 101381 | 2.7% |
| Other values (6) | 106599 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20181206 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3536797 | |
| 2280702 | ||
| T | 1802375 | 8.9% |
| A | 1620638 | 8.0% |
| R | 1324238 | 6.6% |
| N | 1249806 | 6.2% |
| S | 1195936 | 5.9% |
| U | 885956 | 4.4% |
| V | 803988 | 4.0% |
| O | 654836 | 3.2% |
| Other values (50) | 4825934 |
Address Type
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11412 |
| Missing (%) | 0.5% |
| Memory size | 16.5 MiB |
| ADDRESS | |
|---|---|
| INTERSECTION | 152190 |
| BLOCKFACE | 23440 |
| UNRECOGNIZED | 15827 |
| PLACENAME | 1461 |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 7.4149006 |
| Min length | 7 |
Characters and Unicode
| Total characters | 15903627 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ADDRESS |
|---|---|
| 2nd row | ADDRESS |
| 3rd row | ADDRESS |
| 4th row | BLOCKFACE |
| 5th row | ADDRESS |
Common Values
| Value | Count | Frequency (%) |
| ADDRESS | 1951902 | |
| INTERSECTION | 152190 | 7.1% |
| BLOCKFACE | 23440 | 1.1% |
| UNRECOGNIZED | 15827 | 0.7% |
| PLACENAME | 1461 | 0.1% |
| (Missing) | 11412 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| address | 1951902 | |
| intersection | 152190 | 7.1% |
| blockface | 23440 | 1.1% |
| unrecognized | 15827 | 0.7% |
| placename | 1461 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 4055994 | |
| D | 3919631 | |
| E | 2314298 | |
| R | 2119919 | |
| A | 1978264 | |
| N | 337495 | 2.1% |
| I | 320207 | 2.0% |
| T | 304380 | 1.9% |
| C | 216358 | 1.4% |
| O | 191457 | 1.2% |
| Other values (9) | 145624 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15903627 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4055994 | |
| D | 3919631 | |
| E | 2314298 | |
| R | 2119919 | |
| A | 1978264 | |
| N | 337495 | 2.1% |
| I | 320207 | 2.0% |
| T | 304380 | 1.9% |
| C | 216358 | 1.4% |
| O | 191457 | 1.2% |
| Other values (9) | 145624 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15903627 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 4055994 | |
| D | 3919631 | |
| E | 2314298 | |
| R | 2119919 | |
| A | 1978264 | |
| N | 337495 | 2.1% |
| I | 320207 | 2.0% |
| T | 304380 | 1.9% |
| C | 216358 | 1.4% |
| O | 191457 | 1.2% |
| Other values (9) | 145624 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15903627 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 4055994 | |
| D | 3919631 | |
| E | 2314298 | |
| R | 2119919 | |
| A | 1978264 | |
| N | 337495 | 2.1% |
| I | 320207 | 2.0% |
| T | 304380 | 1.9% |
| C | 216358 | 1.4% |
| O | 191457 | 1.2% |
| Other values (9) | 145624 | 0.9% |
City
Text
MISSING 
| Distinct | 180 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 119922 |
| Missing (%) | 5.6% |
| Memory size | 16.5 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 8 |
| Mean length | 8.1729683 |
| Min length | 2 |
Characters and Unicode
| Total characters | 16642697 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 105 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | NEW YORK |
| 4th row | QUEENS |
| 5th row | BRONX |
| Value | Count | Frequency (%) |
| brooklyn | 638131 | |
| new | 411014 | |
| york | 410573 | |
| bronx | 387336 | |
| island | 103251 | 3.7% |
| staten | 84322 | 3.0% |
| jamaica | 51129 | 1.8% |
| astoria | 37460 | 1.3% |
| park | 34631 | 1.2% |
| queens | 32410 | 1.2% |
| Other values (168) | 603227 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 2501348 | |
| N | 1902608 | |
| R | 1731548 | |
| K | 1126019 | 6.8% |
| Y | 1099478 | 6.6% |
| B | 1056666 | 6.3% |
| L | 996022 | 6.0% |
| E | 910922 | 5.5% |
| 757174 | 4.5% | |
| A | 750632 | 4.5% |
| Other values (46) | 3810280 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15884142 | |
| Space Separator | 757174 | 4.5% |
| Lowercase Letter | 1375 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2501348 | |
| N | 1902608 | |
| R | 1731548 | |
| K | 1126019 | 7.1% |
| Y | 1099478 | 6.9% |
| B | 1056666 | 6.7% |
| L | 996022 | 6.3% |
| E | 910922 | 5.7% |
| A | 750632 | 4.7% |
| S | 555863 | 3.5% |
| Other values (16) | 3253036 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 154 | |
| e | 142 | |
| a | 134 | |
| o | 133 | |
| r | 106 | 7.7% |
| t | 93 | 6.8% |
| l | 90 | 6.5% |
| s | 88 | 6.4% |
| i | 67 | 4.9% |
| u | 49 | 3.6% |
| Other values (16) | 319 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 | |
| @ | 1 | 16.7% |
| , | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 757174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15885517 | |
| Common | 757180 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 2501348 | |
| N | 1902608 | |
| R | 1731548 | |
| K | 1126019 | 7.1% |
| Y | 1099478 | 6.9% |
| B | 1056666 | 6.7% |
| L | 996022 | 6.3% |
| E | 910922 | 5.7% |
| A | 750632 | 4.7% |
| S | 555863 | 3.5% |
| Other values (42) | 3254411 |
Common
| Value | Count | Frequency (%) |
| 757174 | ||
| . | 4 | < 0.1% |
| @ | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16642697 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 2501348 | |
| N | 1902608 | |
| R | 1731548 | |
| K | 1126019 | 6.8% |
| Y | 1099478 | 6.6% |
| B | 1056666 | 6.3% |
| L | 996022 | 6.0% |
| E | 910922 | 5.5% |
| 757174 | 4.5% | |
| A | 750632 | 4.5% |
| Other values (46) | 3810280 |
Landmark
Text
MISSING 
| Distinct | 8558 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 797021 |
| Missing (%) | 37.0% |
| Memory size | 16.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 13.295101 |
| Min length | 6 |
Characters and Unicode
| Total characters | 18070848 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 982 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | EAST 230 STREET |
|---|---|
| 2nd row | BAY 50 STREET |
| 3rd row | EAST 29 STREET |
| 4th row | WEST 136 STREET |
| 5th row | SHERIDAN AVENUE |
| Value | Count | Frequency (%) |
| street | 614817 | |
| avenue | 500291 | 16.4% |
| east | 125020 | 4.1% |
| west | 119083 | 3.9% |
| boulevard | 48071 | 1.6% |
| road | 44809 | 1.5% |
| place | 44343 | 1.5% |
| park | 21807 | 0.7% |
| broadway | 20333 | 0.7% |
| parkway | 15318 | 0.5% |
| Other values (4914) | 1501798 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 3173073 | |
| 2111863 | ||
| T | 1836318 | |
| A | 1338445 | 7.4% |
| R | 1270032 | 7.0% |
| S | 1213497 | 6.7% |
| N | 1024492 | 5.7% |
| U | 701265 | 3.9% |
| V | 617959 | 3.4% |
| O | 581272 | 3.2% |
| Other values (33) | 4202632 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14672867 | |
| Space Separator | 2111863 | 11.7% |
| Decimal Number | 1285741 | 7.1% |
| Other Punctuation | 270 | < 0.1% |
| Dash Punctuation | 103 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3173073 | |
| T | 1836318 | |
| A | 1338445 | |
| R | 1270032 | |
| S | 1213497 | 8.3% |
| N | 1024492 | 7.0% |
| U | 701265 | 4.8% |
| V | 617959 | 4.2% |
| O | 581272 | 4.0% |
| L | 411194 | 2.8% |
| Other values (16) | 2505320 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 293816 | |
| 2 | 155272 | |
| 3 | 131223 | |
| 4 | 115902 | 9.0% |
| 7 | 113619 | 8.8% |
| 5 | 113503 | 8.8% |
| 6 | 98485 | 7.7% |
| 8 | 96532 | 7.5% |
| 9 | 89336 | 6.9% |
| 0 | 78053 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 136 | |
| & | 125 | |
| / | 9 | 3.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2111863 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 103 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14672867 | |
| Common | 3397981 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3173073 | |
| T | 1836318 | |
| A | 1338445 | |
| R | 1270032 | |
| S | 1213497 | 8.3% |
| N | 1024492 | 7.0% |
| U | 701265 | 4.8% |
| V | 617959 | 4.2% |
| O | 581272 | 4.0% |
| L | 411194 | 2.8% |
| Other values (16) | 2505320 |
Common
| Value | Count | Frequency (%) |
| 2111863 | ||
| 1 | 293816 | 8.6% |
| 2 | 155272 | 4.6% |
| 3 | 131223 | 3.9% |
| 4 | 115902 | 3.4% |
| 7 | 113619 | 3.3% |
| 5 | 113503 | 3.3% |
| 6 | 98485 | 2.9% |
| 8 | 96532 | 2.8% |
| 9 | 89336 | 2.6% |
| Other values (7) | 78430 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18070848 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 3173073 | |
| 2111863 | ||
| T | 1836318 | |
| A | 1338445 | 7.4% |
| R | 1270032 | 7.0% |
| S | 1213497 | 6.7% |
| N | 1024492 | 5.7% |
| U | 701265 | 3.9% |
| V | 617959 | 3.4% |
| O | 581272 | 3.2% |
| Other values (33) | 4202632 |
Facility Type
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2018905 |
| Missing (%) | 93.6% |
| Memory size | 16.5 MiB |
| N/A | |
|---|---|
| DSNY Garage | 11167 |
Length
| Max length | 11 |
|---|---|
| Median length | 3 |
| Mean length | 3.6505349 |
| Min length | 3 |
Characters and Unicode
| Total characters | 501317 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DSNY Garage |
|---|---|
| 2nd row | N/A |
| 3rd row | N/A |
| 4th row | N/A |
| 5th row | N/A |
Common Values
| Value | Count | Frequency (%) |
| N/A | 126160 | 5.9% |
| DSNY Garage | 11167 | 0.5% |
| (Missing) | 2018905 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n/a | 126160 | |
| dsny | 11167 | 7.5% |
| garage | 11167 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 137327 | |
| / | 126160 | |
| A | 126160 | |
| a | 22334 | 4.5% |
| D | 11167 | 2.2% |
| S | 11167 | 2.2% |
| Y | 11167 | 2.2% |
| 11167 | 2.2% | |
| G | 11167 | 2.2% |
| r | 11167 | 2.2% |
| Other values (2) | 22334 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 308155 | |
| Other Punctuation | 126160 | |
| Lowercase Letter | 55835 | 11.1% |
| Space Separator | 11167 | 2.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 137327 | |
| A | 126160 | |
| D | 11167 | 3.6% |
| S | 11167 | 3.6% |
| Y | 11167 | 3.6% |
| G | 11167 | 3.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 22334 | |
| r | 11167 | |
| g | 11167 | |
| e | 11167 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 126160 |
Space Separator
| Value | Count | Frequency (%) |
| 11167 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 363990 | |
| Common | 137327 | 27.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 137327 | |
| A | 126160 | |
| a | 22334 | 6.1% |
| D | 11167 | 3.1% |
| S | 11167 | 3.1% |
| Y | 11167 | 3.1% |
| G | 11167 | 3.1% |
| r | 11167 | 3.1% |
| g | 11167 | 3.1% |
| e | 11167 | 3.1% |
Common
| Value | Count | Frequency (%) |
| / | 126160 | |
| 11167 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 501317 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 137327 | |
| / | 126160 | |
| A | 126160 | |
| a | 22334 | 4.5% |
| D | 11167 | 2.2% |
| S | 11167 | 2.2% |
| Y | 11167 | 2.2% |
| 11167 | 2.2% | |
| G | 11167 | 2.2% |
| r | 11167 | 2.2% |
| Other values (2) | 22334 | 4.5% |
Status
Categorical
IMBALANCE 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| Closed | |
|---|---|
| In Progress | 98976 |
| Open | 52491 |
| Assigned | 9036 |
| Pending | 2574 |
| Other values (2) | 2501 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.193783 |
| Min length | 4 |
Characters and Unicode
| Total characters | 13355233 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Open |
|---|---|
| 2nd row | Open |
| 3rd row | Open |
| 4th row | Open |
| 5th row | In Progress |
Common Values
| Value | Count | Frequency (%) |
| Closed | 1990654 | |
| In Progress | 98976 | 4.6% |
| Open | 52491 | 2.4% |
| Assigned | 9036 | 0.4% |
| Pending | 2574 | 0.1% |
| Started | 1302 | 0.1% |
| Unspecified | 1199 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| closed | 1990654 | |
| in | 98976 | 4.4% |
| progress | 98976 | 4.4% |
| open | 52491 | 2.3% |
| assigned | 9036 | 0.4% |
| pending | 2574 | 0.1% |
| started | 1302 | 0.1% |
| unspecified | 1199 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 2207877 | |
| e | 2157431 | |
| o | 2089630 | |
| d | 2004765 | |
| C | 1990654 | |
| l | 1990654 | |
| r | 199254 | 1.5% |
| n | 166850 | 1.2% |
| g | 110586 | 0.8% |
| P | 101550 | 0.8% |
| Other values (12) | 335982 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11001049 | |
| Uppercase Letter | 2255208 | 16.9% |
| Space Separator | 98976 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 2207877 | |
| e | 2157431 | |
| o | 2089630 | |
| d | 2004765 | |
| l | 1990654 | |
| r | 199254 | 1.8% |
| n | 166850 | 1.5% |
| g | 110586 | 1.0% |
| p | 53690 | 0.5% |
| i | 14008 | 0.1% |
| Other values (4) | 6304 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1990654 | |
| P | 101550 | 4.5% |
| I | 98976 | 4.4% |
| O | 52491 | 2.3% |
| A | 9036 | 0.4% |
| S | 1302 | 0.1% |
| U | 1199 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 98976 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13256257 | |
| Common | 98976 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 2207877 | |
| e | 2157431 | |
| o | 2089630 | |
| d | 2004765 | |
| C | 1990654 | |
| l | 1990654 | |
| r | 199254 | 1.5% |
| n | 166850 | 1.3% |
| g | 110586 | 0.8% |
| P | 101550 | 0.8% |
| Other values (11) | 237006 | 1.8% |
Common
| Value | Count | Frequency (%) |
| 98976 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13355233 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 2207877 | |
| e | 2157431 | |
| o | 2089630 | |
| d | 2004765 | |
| C | 1990654 | |
| l | 1990654 | |
| r | 199254 | 1.5% |
| n | 166850 | 1.2% |
| g | 110586 | 0.8% |
| P | 101550 | 0.8% |
| Other values (12) | 335982 | 2.5% |
Due Date
Date
MISSING 
| Distinct | 5925 |
|---|---|
| Distinct (%) | 73.1% |
| Missing | 2148125 |
| Missing (%) | 99.6% |
| Memory size | 16.5 MiB |
| Minimum | 2023-03-21 09:12:47 |
|---|---|
| Maximum | 2024-01-24 02:22:23 |
MISSING 
| Distinct | 739 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 55803 |
| Missing (%) | 2.6% |
| Memory size | 16.5 MiB |
Length
| Max length | 930 |
|---|---|
| Median length | 575 |
| Mean length | 151.18653 |
| Min length | 3 |
Characters and Unicode
| Total characters | 317556582 |
|---|---|
| Distinct characters | 81 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | If the abandoned vehicle meets the criteria to be classified as a derelict (i.e. junk) the Department of Sanitation (DSNY) will investigate and tag the vehicle within three business days. |
|---|---|
| 2nd row | If the abandoned vehicle meets the criteria to be classified as a derelict (i.e. junk) the Department of Sanitation (DSNY) will investigate and tag the vehicle within three business days. |
| 3rd row | If the abandoned vehicle meets the criteria to be classified as a derelict (i.e. junk) the Department of Sanitation (DSNY) will investigate and tag the vehicle within three business days. |
| 4th row | The Department of Transportation referred this complaint to the appropriate Maintenance Unit for repair. |
| 5th row | The Department of Homeless Services has sent a mobile outreach response team to the location. |
| Value | Count | Frequency (%) |
| the | 6080023 | 12.6% |
| to | 1914555 | 4.0% |
| department | 1847205 | 3.8% |
| and | 1813274 | 3.7% |
| complaint | 1708232 | 3.5% |
| of | 1574285 | 3.3% |
| police | 1113417 | 2.3% |
| a | 853708 | 1.8% |
| responded | 829288 | 1.7% |
| condition | 799290 | 1.7% |
| Other values (1438) | 29876348 |
Most occurring characters
| Value | Count | Frequency (%) |
| 46457318 | ||
| e | 32633328 | 10.3% |
| t | 27579074 | 8.7% |
| o | 24713650 | 7.8% |
| n | 21608051 | 6.8% |
| i | 21200334 | 6.7% |
| a | 18432903 | 5.8% |
| r | 13072333 | 4.1% |
| s | 12323006 | 3.9% |
| d | 10423607 | 3.3% |
| Other values (71) | 89112978 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 251776115 | |
| Space Separator | 46516312 | 14.6% |
| Uppercase Letter | 12078569 | 3.8% |
| Other Punctuation | 4856274 | 1.5% |
| Decimal Number | 1316524 | 0.4% |
| Dash Punctuation | 301582 | 0.1% |
| Control | 266146 | 0.1% |
| Close Punctuation | 198068 | 0.1% |
| Open Punctuation | 198068 | 0.1% |
| Connector Punctuation | 48924 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 32633328 | |
| t | 27579074 | |
| o | 24713650 | |
| n | 21608051 | 8.6% |
| i | 21200334 | 8.4% |
| a | 18432903 | 7.3% |
| r | 13072333 | 5.2% |
| s | 12323006 | 4.9% |
| d | 10423607 | 4.1% |
| l | 10256341 | 4.1% |
| Other values (17) | 59533488 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2570812 | |
| D | 2466716 | |
| P | 1859224 | |
| N | 758773 | 6.3% |
| C | 703609 | 5.8% |
| Y | 594723 | 4.9% |
| H | 579893 | 4.8% |
| S | 424763 | 3.5% |
| I | 375312 | 3.1% |
| E | 266671 | 2.2% |
| Other values (15) | 1478073 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 397571 | |
| 2 | 243750 | |
| 3 | 208471 | |
| 9 | 129719 | 9.9% |
| 6 | 126849 | 9.6% |
| 5 | 80757 | 6.1% |
| 7 | 69910 | 5.3% |
| 0 | 38921 | 3.0% |
| 8 | 11130 | 0.8% |
| 4 | 9446 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3734744 | |
| / | 454528 | 9.4% |
| , | 454314 | 9.4% |
| ' | 109239 | 2.2% |
| : | 52234 | 1.1% |
| " | 47707 | 1.0% |
| ; | 1718 | < 0.1% |
| & | 1441 | < 0.1% |
| @ | 349 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| | 133073 | |
| | 54691 | |
| | 39191 | 14.7% |
| | 39191 | 14.7% |
Space Separator
| Value | Count | Frequency (%) |
| 46457318 | ||
| 58994 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 301582 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 198068 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 198068 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 48924 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 263854684 | |
| Common | 53701898 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 32633328 | |
| t | 27579074 | 10.5% |
| o | 24713650 | 9.4% |
| n | 21608051 | 8.2% |
| i | 21200334 | 8.0% |
| a | 18432903 | 7.0% |
| r | 13072333 | 5.0% |
| s | 12323006 | 4.7% |
| d | 10423607 | 4.0% |
| l | 10256341 | 3.9% |
| Other values (42) | 71612057 |
Common
| Value | Count | Frequency (%) |
| 46457318 | ||
| . | 3734744 | 7.0% |
| / | 454528 | 0.8% |
| , | 454314 | 0.8% |
| 1 | 397571 | 0.7% |
| - | 301582 | 0.6% |
| 2 | 243750 | 0.5% |
| 3 | 208471 | 0.4% |
| ) | 198068 | 0.4% |
| ( | 198068 | 0.4% |
| Other values (19) | 1053484 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317039375 | |
| None | 517207 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 46457318 | ||
| e | 32633328 | 10.3% |
| t | 27579074 | 8.7% |
| o | 24713650 | 7.8% |
| n | 21608051 | 6.8% |
| i | 21200334 | 6.7% |
| a | 18432903 | 5.8% |
| r | 13072333 | 4.1% |
| s | 12323006 | 3.9% |
| d | 10423607 | 3.3% |
| Other values (64) | 88595771 |
None
| Value | Count | Frequency (%) |
| | 133073 | |
| â | 133073 | |
| 58994 | ||
| Â | 58994 | |
| | 54691 | |
| | 39191 | 7.6% |
| | 39191 | 7.6% |
Resolution Action Updated Date
Date
MISSING 
| Distinct | 1321541 |
|---|---|
| Distinct (%) | 62.8% |
| Missing | 51376 |
| Missing (%) | 2.4% |
| Memory size | 16.5 MiB |
| Minimum | 2022-08-22 11:44:18 |
|---|---|
| Maximum | 2023-11-08 22:00:00 |
Community Board
Text
| Distinct | 77 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 10.489153 |
| Min length | 8 |
Characters and Unicode
| Total characters | 22617048 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 16 BROOKLYN |
|---|---|
| 2nd row | 15 BROOKLYN |
| 3rd row | 12 MANHATTAN |
| 4th row | 01 QUEENS |
| 5th row | Unspecified BROOKLYN |
| Value | Count | Frequency (%) |
| brooklyn | 667183 | |
| queens | 533769 | |
| manhattan | 456653 | 10.4% |
| bronx | 405022 | 9.2% |
| 12 | 194052 | 4.4% |
| 05 | 189609 | 4.3% |
| 01 | 185523 | 4.2% |
| 03 | 177685 | 4.0% |
| 07 | 171715 | 3.9% |
| 04 | 153259 | 3.5% |
| Other values (28) | 1266674 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2696640 | 11.9% |
| 2244912 | 9.9% | |
| O | 1739388 | 7.7% |
| 0 | 1552738 | 6.9% |
| A | 1547319 | 6.8% |
| E | 1156218 | 5.1% |
| T | 1090666 | 4.8% |
| B | 1072205 | 4.7% |
| R | 1072205 | 4.7% |
| 1 | 1008760 | 4.5% |
| Other values (27) | 7435997 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15774283 | |
| Decimal Number | 4247273 | 18.8% |
| Space Separator | 2244912 | 9.9% |
| Lowercase Letter | 350580 | 1.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2696640 | |
| O | 1739388 | |
| A | 1547319 | |
| E | 1156218 | 7.3% |
| T | 1090666 | 6.9% |
| B | 1072205 | 6.8% |
| R | 1072205 | 6.8% |
| L | 755863 | 4.8% |
| S | 711129 | 4.5% |
| K | 667183 | 4.2% |
| Other values (8) | 3265467 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1552738 | |
| 1 | 1008760 | |
| 2 | 344944 | 8.1% |
| 3 | 236534 | 5.6% |
| 5 | 228309 | 5.4% |
| 4 | 223379 | 5.3% |
| 7 | 210078 | 4.9% |
| 8 | 175231 | 4.1% |
| 9 | 134300 | 3.2% |
| 6 | 133000 | 3.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 70116 | |
| i | 70116 | |
| n | 35058 | |
| s | 35058 | |
| p | 35058 | |
| c | 35058 | |
| f | 35058 | |
| d | 35058 |
Space Separator
| Value | Count | Frequency (%) |
| 2244912 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16124863 | |
| Common | 6492185 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2696640 | |
| O | 1739388 | |
| A | 1547319 | |
| E | 1156218 | 7.2% |
| T | 1090666 | 6.8% |
| B | 1072205 | 6.6% |
| R | 1072205 | 6.6% |
| L | 755863 | 4.7% |
| S | 711129 | 4.4% |
| K | 667183 | 4.1% |
| Other values (16) | 3616047 |
Common
| Value | Count | Frequency (%) |
| 2244912 | ||
| 0 | 1552738 | |
| 1 | 1008760 | |
| 2 | 344944 | 5.3% |
| 3 | 236534 | 3.6% |
| 5 | 228309 | 3.5% |
| 4 | 223379 | 3.4% |
| 7 | 210078 | 3.2% |
| 8 | 175231 | 2.7% |
| 9 | 134300 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22617048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2696640 | 11.9% |
| 2244912 | 9.9% | |
| O | 1739388 | 7.7% |
| 0 | 1552738 | 6.9% |
| A | 1547319 | 6.8% |
| E | 1156218 | 5.1% |
| T | 1090666 | 4.8% |
| B | 1072205 | 4.7% |
| R | 1072205 | 4.7% |
| 1 | 1008760 | 4.5% |
| Other values (27) | 7435997 |
BBL
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 339263 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 261151 |
| Missing (%) | 12.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7445135 × 109 |
| Minimum | 0 |
|---|---|
| Maximum | 5.2700005 × 109 |
| Zeros | 367 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.0079 × 109 |
| Q1 | 2.02744 × 109 |
| median | 3.01971 × 109 |
| Q3 | 4.0089301 × 109 |
| 95-th percentile | 4.15572 × 109 |
| Maximum | 5.2700005 × 109 |
| Range | 5.2700005 × 109 |
| Interquartile range (IQR) | 1.98149 × 109 |
Descriptive statistics
| Standard deviation | 1.1775907 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.42907083 |
| Kurtosis | -1.0316237 |
| Mean | 2.7445135 × 109 |
| Median Absolute Deviation (MAD) | 9.9175 × 108 |
| Skewness | -0.077409369 |
| Sum | 5.2010754 × 1015 |
| Variance | 1.3867199 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4068290001 | 8168 | 0.4% |
| 4068580001 | 3830 | 0.2% |
| 4114340007 | 3781 | 0.2% |
| 1011110001 | 3021 | 0.1% |
| 2025110068 | 2811 | 0.1% |
| 3003370027 | 2583 | 0.1% |
| 2048330028 | 2305 | 0.1% |
| 3044520423 | 2277 | 0.1% |
| 2029020036 | 2157 | 0.1% |
| 4142600001 | 2118 | 0.1% |
| Other values (339253) | 1862030 | |
| (Missing) | 261151 | 12.1% |
| Value | Count | Frequency (%) |
| 0 | 367 | |
| 1000010010 | 12 | < 0.1% |
| 1000010101 | 1 | < 0.1% |
| 1000010111 | 1 | < 0.1% |
| 1000010201 | 2 | < 0.1% |
| 1000020001 | 19 | < 0.1% |
| 1000020002 | 40 | < 0.1% |
| 1000020023 | 7 | < 0.1% |
| 1000030001 | 141 | < 0.1% |
| 1000030010 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 5270000519 | 2 | < 0.1% |
| 5270000511 | 2 | < 0.1% |
| 5270000501 | 9 | |
| 5240009999 | 1 | < 0.1% |
| 5200479999 | 1 | < 0.1% |
| 5200429999 | 5 | |
| 5200399999 | 1 | < 0.1% |
| 5200389999 | 2 | < 0.1% |
| 5200379999 | 3 | < 0.1% |
| 5200169999 | 1 | < 0.1% |
Borough
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 7.3674043 |
| Min length | 5 |
Characters and Unicode
| Total characters | 15885833 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | MANHATTAN |
| 4th row | QUEENS |
| 5th row | BROOKLYN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 667183 | |
| QUEENS | 533762 | |
| MANHATTAN | 457593 | |
| BRONX | 404089 | |
| STATEN ISLAND | 88680 | 4.1% |
| Unspecified | 4925 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 667183 | |
| queens | 533762 | |
| manhattan | 457593 | |
| bronx | 404089 | |
| staten | 88680 | 4.0% |
| island | 88680 | 4.0% |
| unspecified | 4925 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.7% |
| R | 1071272 | 6.7% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (17) | 3374197 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15747903 | |
| Space Separator | 88680 | 0.6% |
| Lowercase Letter | 49250 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.8% |
| R | 1071272 | 6.8% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (8) | 3236267 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9850 | |
| i | 9850 | |
| n | 4925 | |
| s | 4925 | |
| p | 4925 | |
| c | 4925 | |
| f | 4925 | |
| d | 4925 |
Space Separator
| Value | Count | Frequency (%) |
| 88680 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15797153 | |
| Common | 88680 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.8% |
| R | 1071272 | 6.8% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (16) | 3285517 |
Common
| Value | Count | Frequency (%) |
| 88680 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15885833 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.7% |
| R | 1071272 | 6.7% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (17) | 3374197 |
X Coordinate (State Plane)
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 108621 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 33541 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005322.7 |
| Minimum | 913353 |
|---|---|
| Maximum | 1067281 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 913353 |
|---|---|
| 5-th percentile | 978282 |
| Q1 | 992506 |
| median | 1004135 |
| Q3 | 1018124 |
| 95-th percentile | 1042283 |
| Maximum | 1067281 |
| Range | 153928 |
| Interquartile range (IQR) | 25618 |
Descriptive statistics
| Standard deviation | 21603.603 |
|---|---|
| Coefficient of variation (CV) | 0.021489223 |
| Kurtosis | 1.5962722 |
| Mean | 1005322.7 |
| Median Absolute Deviation (MAD) | 12660 |
| Skewness | -0.32947563 |
| Sum | 2.1339894 × 1012 |
| Variance | 4.6671566 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1037000 | 8195 | 0.4% |
| 1038740 | 3860 | 0.2% |
| 994623 | 2806 | 0.1% |
| 1003936 | 2648 | 0.1% |
| 984166 | 2610 | 0.1% |
| 1022911 | 2320 | 0.1% |
| 1019422 | 2296 | 0.1% |
| 1026729 | 2274 | 0.1% |
| 1000482 | 2024 | 0.1% |
| 1043001 | 1973 | 0.1% |
| Other values (108611) | 2091685 | |
| (Missing) | 33541 | 1.6% |
| Value | Count | Frequency (%) |
| 913353 | 1 | < 0.1% |
| 913412 | 1 | < 0.1% |
| 913414 | 1 | < 0.1% |
| 913432 | 2 | |
| 913444 | 1 | < 0.1% |
| 913459 | 3 | |
| 913512 | 1 | < 0.1% |
| 913554 | 1 | < 0.1% |
| 913628 | 1 | < 0.1% |
| 913683 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1067281 | 4 | |
| 1067279 | 2 | < 0.1% |
| 1067220 | 2 | < 0.1% |
| 1067180 | 2 | < 0.1% |
| 1067178 | 6 | |
| 1067176 | 2 | < 0.1% |
| 1067173 | 2 | < 0.1% |
| 1067133 | 1 | < 0.1% |
| 1067132 | 1 | < 0.1% |
| 1067131 | 2 | < 0.1% |
Y Coordinate (State Plane)
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 121969 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 32919 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 206249.2 |
| Minimum | 121098 |
|---|---|
| Maximum | 271876 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 121098 |
|---|---|
| 5-th percentile | 156949 |
| Q1 | 184165 |
| median | 203853 |
| Q3 | 233032 |
| 95-th percentile | 255604 |
| Maximum | 271876 |
| Range | 150778 |
| Interquartile range (IQR) | 48867 |
Descriptive statistics
| Standard deviation | 30846.404 |
|---|---|
| Coefficient of variation (CV) | 0.1495589 |
| Kurtosis | -0.8260381 |
| Mean | 206249.2 |
| Median Absolute Deviation (MAD) | 22655 |
| Skewness | 0.023847716 |
| Sum | 4.379316 × 1011 |
| Variance | 9.5150064 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 202363 | 8212 | 0.4% |
| 200651 | 3879 | 0.2% |
| 222956 | 2704 | 0.1% |
| 188346 | 2618 | 0.1% |
| 242233 | 2608 | 0.1% |
| 264242 | 2306 | 0.1% |
| 177846 | 2294 | 0.1% |
| 182858 | 2291 | 0.1% |
| 237497 | 1992 | 0.1% |
| 175548 | 1973 | 0.1% |
| Other values (121959) | 2092436 | |
| (Missing) | 32919 | 1.5% |
| Value | Count | Frequency (%) |
| 121098 | 1 | < 0.1% |
| 121140 | 1 | < 0.1% |
| 121152 | 1 | < 0.1% |
| 121189 | 2 | < 0.1% |
| 121213 | 1 | < 0.1% |
| 121268 | 1 | < 0.1% |
| 121280 | 3 | < 0.1% |
| 121305 | 5 | < 0.1% |
| 121315 | 1 | < 0.1% |
| 121316 | 15 |
| Value | Count | Frequency (%) |
| 271876 | 23 | |
| 271861 | 1 | < 0.1% |
| 271730 | 4 | < 0.1% |
| 271676 | 17 | |
| 271672 | 1 | < 0.1% |
| 271664 | 6 | < 0.1% |
| 271660 | 3 | < 0.1% |
| 271640 | 1 | < 0.1% |
| 271639 | 3 | < 0.1% |
| 271629 | 1 | < 0.1% |
Open Data Channel Type
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| ONLINE | |
|---|---|
| PHONE | |
| MOBILE | |
| UNKNOWN | |
| OTHER | 151 |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.7704324 |
| Min length | 5 |
Characters and Unicode
| Total characters | 12442391 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PHONE |
|---|---|
| 2nd row | PHONE |
| 3rd row | PHONE |
| 4th row | UNKNOWN |
| 5th row | MOBILE |
Common Values
| Value | Count | Frequency (%) |
| ONLINE | 905705 | |
| PHONE | 662826 | |
| MOBILE | 419574 | |
| UNKNOWN | 167976 | 7.8% |
| OTHER | 151 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 905705 | |
| phone | 662826 | |
| mobile | 419574 | |
| unknown | 167976 | 7.8% |
| other | 151 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2978164 | |
| O | 2156232 | |
| E | 1988256 | |
| L | 1325279 | |
| I | 1325279 | |
| H | 662977 | 5.3% |
| P | 662826 | 5.3% |
| M | 419574 | 3.4% |
| B | 419574 | 3.4% |
| U | 167976 | 1.4% |
| Other values (4) | 336254 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12442391 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2978164 | |
| O | 2156232 | |
| E | 1988256 | |
| L | 1325279 | |
| I | 1325279 | |
| H | 662977 | 5.3% |
| P | 662826 | 5.3% |
| M | 419574 | 3.4% |
| B | 419574 | 3.4% |
| U | 167976 | 1.4% |
| Other values (4) | 336254 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12442391 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2978164 | |
| O | 2156232 | |
| E | 1988256 | |
| L | 1325279 | |
| I | 1325279 | |
| H | 662977 | 5.3% |
| P | 662826 | 5.3% |
| M | 419574 | 3.4% |
| B | 419574 | 3.4% |
| U | 167976 | 1.4% |
| Other values (4) | 336254 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12442391 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2978164 | |
| O | 2156232 | |
| E | 1988256 | |
| L | 1325279 | |
| I | 1325279 | |
| H | 662977 | 5.3% |
| P | 662826 | 5.3% |
| M | 419574 | 3.4% |
| B | 419574 | 3.4% |
| U | 167976 | 1.4% |
| Other values (4) | 336254 | 2.7% |
| Distinct | 1295 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 11 |
| Mean length | 11.03361 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23791022 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 460 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 2141704 | |
| park | 8297 | 0.4% |
| playground | 2304 | 0.1% |
| n/a | 2141 | 0.1% |
| central | 1442 | 0.1% |
| corona | 735 | < 0.1% |
| meadows | 726 | < 0.1% |
| flushing | 726 | < 0.1% |
| leif | 616 | < 0.1% |
| ericson | 616 | < 0.1% |
| Other values (1616) | 18774 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4297371 | |
| i | 4291777 | |
| n | 2156365 | |
| s | 2148692 | |
| d | 2148165 | |
| c | 2145076 | |
| f | 2142835 | |
| p | 2142806 | |
| U | 2141960 | |
| a | 23094 | 0.1% |
| Other values (68) | 152881 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21583291 | |
| Uppercase Letter | 2180442 | 9.2% |
| Space Separator | 21849 | 0.1% |
| Other Punctuation | 3970 | < 0.1% |
| Decimal Number | 1054 | < 0.1% |
| Open Punctuation | 161 | < 0.1% |
| Close Punctuation | 157 | < 0.1% |
| Dash Punctuation | 89 | < 0.1% |
| Control | 6 | < 0.1% |
| Format | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4297371 | |
| i | 4291777 | |
| n | 2156365 | |
| s | 2148692 | |
| d | 2148165 | |
| c | 2145076 | |
| f | 2142835 | |
| p | 2142806 | |
| a | 23094 | 0.1% |
| r | 22403 | 0.1% |
| Other values (17) | 64707 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2141960 | |
| P | 12032 | 0.6% |
| C | 3777 | 0.2% |
| A | 2864 | 0.1% |
| S | 2470 | 0.1% |
| N | 2415 | 0.1% |
| M | 2121 | 0.1% |
| F | 1484 | 0.1% |
| B | 1458 | 0.1% |
| H | 1346 | 0.1% |
| Other values (17) | 8515 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 226 | |
| 0 | 178 | |
| 2 | 122 | |
| 4 | 94 | |
| 9 | 93 | |
| 5 | 82 | 7.8% |
| 6 | 79 | 7.5% |
| 3 | 66 | 6.3% |
| 8 | 62 | 5.9% |
| 7 | 52 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2203 | |
| . | 1292 | |
| ' | 401 | 10.1% |
| " | 30 | 0.8% |
| & | 22 | 0.6% |
| , | 17 | 0.4% |
| % | 5 | 0.1% |
Control
| Value | Count | Frequency (%) |
| | 3 | |
| | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 21849 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 161 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 157 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 89 |
Format
| Value | Count | Frequency (%) |
| | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23763733 | |
| Common | 27289 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4297371 | |
| i | 4291777 | |
| n | 2156365 | |
| s | 2148692 | |
| d | 2148165 | |
| c | 2145076 | |
| f | 2142835 | |
| p | 2142806 | |
| U | 2141960 | |
| a | 23094 | 0.1% |
| Other values (44) | 125592 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 21849 | ||
| / | 2203 | 8.1% |
| . | 1292 | 4.7% |
| ' | 401 | 1.5% |
| 1 | 226 | 0.8% |
| 0 | 178 | 0.7% |
| ( | 161 | 0.6% |
| ) | 157 | 0.6% |
| 2 | 122 | 0.4% |
| 4 | 94 | 0.3% |
| Other values (14) | 606 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23791007 | |
| None | 15 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4297371 | |
| i | 4291777 | |
| n | 2156365 | |
| s | 2148692 | |
| d | 2148165 | |
| c | 2145076 | |
| f | 2142835 | |
| p | 2142806 | |
| U | 2141960 | |
| a | 23094 | 0.1% |
| Other values (63) | 152866 | 0.6% |
None
| Value | Count | Frequency (%) |
| Ã | 3 | |
| | 3 | |
| â | 3 | |
| | 3 | |
| | 3 |
Park Borough
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 7.3674043 |
| Min length | 5 |
Characters and Unicode
| Total characters | 15885833 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | MANHATTAN |
| 4th row | QUEENS |
| 5th row | BROOKLYN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 667183 | |
| QUEENS | 533762 | |
| MANHATTAN | 457593 | |
| BRONX | 404089 | |
| STATEN ISLAND | 88680 | 4.1% |
| Unspecified | 4925 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 667183 | |
| queens | 533762 | |
| manhattan | 457593 | |
| bronx | 404089 | |
| staten | 88680 | 4.0% |
| island | 88680 | 4.0% |
| unspecified | 4925 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.7% |
| R | 1071272 | 6.7% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (17) | 3374197 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15747903 | |
| Space Separator | 88680 | 0.6% |
| Lowercase Letter | 49250 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.8% |
| R | 1071272 | 6.8% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (8) | 3236267 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9850 | |
| i | 9850 | |
| n | 4925 | |
| s | 4925 | |
| p | 4925 | |
| c | 4925 | |
| f | 4925 | |
| d | 4925 |
Space Separator
| Value | Count | Frequency (%) |
| 88680 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15797153 | |
| Common | 88680 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.8% |
| R | 1071272 | 6.8% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (16) | 3285517 |
Common
| Value | Count | Frequency (%) |
| 88680 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15885833 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2697580 | |
| O | 1738455 | |
| A | 1550139 | |
| E | 1156204 | 7.3% |
| T | 1092546 | 6.9% |
| B | 1071272 | 6.7% |
| R | 1071272 | 6.7% |
| L | 755863 | 4.8% |
| S | 711122 | 4.5% |
| K | 667183 | 4.2% |
| Other values (17) | 3374197 |
Vehicle Type
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2155736 |
| Missing (%) | > 99.9% |
| Memory size | 16.5 MiB |
| Car Service | |
|---|---|
| Ambulette / Paratransit | 16 |
| Commuter Van | 14 |
| Green Taxi | 2 |
Length
| Max length | 23 |
|---|---|
| Median length | 11 |
| Mean length | 11.41129 |
| Min length | 10 |
Characters and Unicode
| Total characters | 5660 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Car Service |
|---|---|
| 2nd row | Car Service |
| 3rd row | Car Service |
| 4th row | Car Service |
| 5th row | Ambulette / Paratransit |
Common Values
| Value | Count | Frequency (%) |
| Car Service | 464 | < 0.1% |
| Ambulette / Paratransit | 16 | < 0.1% |
| Commuter Van | 14 | < 0.1% |
| Green Taxi | 2 | < 0.1% |
| (Missing) | 2155736 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| car | 464 | |
| service | 464 | |
| ambulette | 16 | 1.6% |
| 16 | 1.6% | |
| paratransit | 16 | 1.6% |
| commuter | 14 | 1.4% |
| van | 14 | 1.4% |
| green | 2 | 0.2% |
| taxi | 2 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 978 | |
| r | 976 | |
| a | 528 | |
| 512 | ||
| i | 482 | |
| C | 478 | |
| S | 464 | |
| v | 464 | |
| c | 464 | |
| t | 78 | 1.4% |
| Other values (14) | 236 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4140 | |
| Uppercase Letter | 992 | 17.5% |
| Space Separator | 512 | 9.0% |
| Other Punctuation | 16 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 978 | |
| r | 976 | |
| a | 528 | |
| i | 482 | |
| v | 464 | |
| c | 464 | |
| t | 78 | 1.9% |
| m | 44 | 1.1% |
| n | 32 | 0.8% |
| u | 30 | 0.7% |
| Other values (5) | 64 | 1.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 478 | |
| S | 464 | |
| P | 16 | 1.6% |
| A | 16 | 1.6% |
| V | 14 | 1.4% |
| G | 2 | 0.2% |
| T | 2 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 512 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5132 | |
| Common | 528 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 978 | |
| r | 976 | |
| a | 528 | |
| i | 482 | |
| C | 478 | |
| S | 464 | |
| v | 464 | |
| c | 464 | |
| t | 78 | 1.5% |
| m | 44 | 0.9% |
| Other values (12) | 176 | 3.4% |
Common
| Value | Count | Frequency (%) |
| 512 | ||
| / | 16 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 978 | |
| r | 976 | |
| a | 528 | |
| 512 | ||
| i | 482 | |
| C | 478 | |
| S | 464 | |
| v | 464 | |
| c | 464 | |
| t | 78 | 1.4% |
| Other values (14) | 236 | 4.2% |
Taxi Company Borough
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 2155022 |
| Missing (%) | 99.9% |
| Memory size | 16.5 MiB |
| MANHATTAN | |
|---|---|
| QUEENS | |
| BROOKLYN | |
| BRONX | |
| STATEN ISLAND | 29 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.438843 |
| Min length | 5 |
Characters and Unicode
| Total characters | 9001 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MANHATTAN |
|---|---|
| 2nd row | QUEENS |
| 3rd row | QUEENS |
| 4th row | QUEENS |
| 5th row | BROOKLYN |
Common Values
| Value | Count | Frequency (%) |
| MANHATTAN | 390 | < 0.1% |
| QUEENS | 295 | < 0.1% |
| BROOKLYN | 288 | < 0.1% |
| BRONX | 208 | < 0.1% |
| STATEN ISLAND | 29 | < 0.1% |
| (Missing) | 2155022 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manhattan | 390 | |
| queens | 295 | |
| brooklyn | 288 | |
| bronx | 208 | |
| staten | 29 | 2.3% |
| island | 29 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1629 | |
| A | 1228 | |
| T | 838 | |
| O | 784 | |
| E | 619 | 6.9% |
| B | 496 | 5.5% |
| R | 496 | 5.5% |
| M | 390 | 4.3% |
| H | 390 | 4.3% |
| S | 353 | 3.9% |
| Other values (9) | 1778 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8972 | |
| Space Separator | 29 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1629 | |
| A | 1228 | |
| T | 838 | |
| O | 784 | |
| E | 619 | 6.9% |
| B | 496 | 5.5% |
| R | 496 | 5.5% |
| M | 390 | 4.3% |
| H | 390 | 4.3% |
| S | 353 | 3.9% |
| Other values (8) | 1749 |
Space Separator
| Value | Count | Frequency (%) |
| 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8972 | |
| Common | 29 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1629 | |
| A | 1228 | |
| T | 838 | |
| O | 784 | |
| E | 619 | 6.9% |
| B | 496 | 5.5% |
| R | 496 | 5.5% |
| M | 390 | 4.3% |
| H | 390 | 4.3% |
| S | 353 | 3.9% |
| Other values (8) | 1749 |
Common
| Value | Count | Frequency (%) |
| 29 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9001 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1629 | |
| A | 1228 | |
| T | 838 | |
| O | 784 | |
| E | 619 | 6.9% |
| B | 496 | 5.5% |
| R | 496 | 5.5% |
| M | 390 | 4.3% |
| H | 390 | 4.3% |
| S | 353 | 3.9% |
| Other values (9) | 1778 |
MISSING 
| Distinct | 12614 |
|---|---|
| Distinct (%) | 53.0% |
| Missing | 2132427 |
| Missing (%) | 98.9% |
| Memory size | 16.5 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 57 |
| Mean length | 48.655325 |
| Min length | 11 |
Characters and Unicode
| Total characters | 1158240 |
|---|---|
| Distinct characters | 65 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9196 ? |
|---|---|
| Unique (%) | 38.6% |
Sample
| 1st row | 125 COLUMBUS AVENUE, MANHATTAN (NEW YORK), NY, 10023 |
|---|---|
| 2nd row | 215 EAST 68 STREET, MANHATTAN (NEW YORK), NY, 10065 |
| 3rd row | 641 8 AVENUE, MANHATTAN (NEW YORK), NY, 10036 |
| 4th row | 37 BLUE SLIP, BROOKLYN, NY, 11222 |
| 5th row | 315 WEST 20 STREET, MANHATTAN (NEW YORK), NY, 10011 |
| Value | Count | Frequency (%) |
| ny | 23670 | 12.5% |
| manhattan | 15126 | 8.0% |
| york | 12461 | 6.6% |
| new | 12362 | 6.5% |
| street | 11085 | 5.8% |
| avenue | 9547 | 5.0% |
| east | 4964 | 2.6% |
| queens | 4489 | 2.4% |
| west | 3753 | 2.0% |
| brooklyn | 3490 | 1.8% |
| Other values (4057) | 88917 |
Most occurring characters
| Value | Count | Frequency (%) |
| 179912 | ||
| N | 101842 | 8.8% |
| A | 86904 | 7.5% |
| E | 85518 | 7.4% |
| T | 71981 | 6.2% |
| , | 71080 | 6.1% |
| 1 | 56576 | 4.9% |
| 0 | 45470 | 3.9% |
| R | 44589 | 3.8% |
| Y | 44373 | 3.8% |
| Other values (55) | 369995 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 681662 | |
| Decimal Number | 190259 | 16.4% |
| Space Separator | 179912 | 15.5% |
| Other Punctuation | 71085 | 6.1% |
| Open Punctuation | 16604 | 1.4% |
| Close Punctuation | 16602 | 1.4% |
| Dash Punctuation | 1266 | 0.1% |
| Lowercase Letter | 850 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 101842 | |
| A | 86904 | |
| E | 85518 | |
| T | 71981 | |
| R | 44589 | 6.5% |
| Y | 44373 | 6.5% |
| O | 33625 | 4.9% |
| S | 31566 | 4.6% |
| H | 21313 | 3.1% |
| M | 20787 | 3.0% |
| Other values (16) | 139164 |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 191 | |
| i | 120 | |
| t | 116 | |
| o | 88 | |
| p | 80 | |
| a | 77 | |
| e | 39 | 4.6% |
| d | 33 | 3.9% |
| u | 23 | 2.7% |
| n | 20 | 2.4% |
| Other values (11) | 63 | 7.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 56576 | |
| 0 | 45470 | |
| 2 | 19282 | 10.1% |
| 3 | 16213 | 8.5% |
| 4 | 11117 | 5.8% |
| 5 | 9900 | 5.2% |
| 6 | 9644 | 5.1% |
| 7 | 7783 | 4.1% |
| 9 | 7307 | 3.8% |
| 8 | 6967 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 71080 | |
| / | 2 | < 0.1% |
| ' | 2 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 179912 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16604 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16602 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 682512 | |
| Common | 475728 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 101842 | |
| A | 86904 | |
| E | 85518 | |
| T | 71981 | |
| R | 44589 | 6.5% |
| Y | 44373 | 6.5% |
| O | 33625 | 4.9% |
| S | 31566 | 4.6% |
| H | 21313 | 3.1% |
| M | 20787 | 3.0% |
| Other values (37) | 140014 |
Common
| Value | Count | Frequency (%) |
| 179912 | ||
| , | 71080 | 14.9% |
| 1 | 56576 | 11.9% |
| 0 | 45470 | 9.6% |
| 2 | 19282 | 4.1% |
| ( | 16604 | 3.5% |
| ) | 16602 | 3.5% |
| 3 | 16213 | 3.4% |
| 4 | 11117 | 2.3% |
| 5 | 9900 | 2.1% |
| Other values (8) | 32972 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1158240 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 179912 | ||
| N | 101842 | 8.8% |
| A | 86904 | 7.5% |
| E | 85518 | 7.4% |
| T | 71981 | 6.2% |
| , | 71080 | 6.1% |
| 1 | 56576 | 4.9% |
| 0 | 45470 | 3.9% |
| R | 44589 | 3.8% |
| Y | 44373 | 3.8% |
| Other values (55) | 369995 |
MISSING 
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2140247 |
| Missing (%) | 99.3% |
| Memory size | 16.5 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 1 |
| Mean length | 4.2508602 |
| Min length | 1 |
Characters and Unicode
| Total characters | 67950 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | E |
| 3rd row | 7 |
| 4th row | 6 |
| 5th row | 2 |
| Value | Count | Frequency (%) |
| 1 | 1586 | 7.2% |
| r | 1547 | 7.0% |
| expwy | 1497 | 6.8% |
| f | 1062 | 4.8% |
| pkwy | 1010 | 4.6% |
| 6 | 981 | 4.5% |
| e | 933 | 4.2% |
| 2 | 928 | 4.2% |
| a | 772 | 3.5% |
| q | 642 | 2.9% |
| Other values (112) | 11021 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5994 | 8.8% | |
| n | 3667 | 5.4% |
| r | 3216 | 4.7% |
| y | 3176 | 4.7% |
| w | 3170 | 4.7% |
| E | 2794 | 4.1% |
| a | 2732 | 4.0% |
| e | 2657 | 3.9% |
| R | 2404 | 3.5% |
| o | 2219 | 3.3% |
| Other values (52) | 35921 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36641 | |
| Uppercase Letter | 19189 | |
| Space Separator | 5994 | 8.8% |
| Decimal Number | 5188 | 7.6% |
| Other Punctuation | 843 | 1.2% |
| Dash Punctuation | 95 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3667 | 10.0% |
| r | 3216 | 8.8% |
| y | 3176 | 8.7% |
| w | 3170 | 8.7% |
| a | 2732 | 7.5% |
| e | 2657 | 7.3% |
| o | 2219 | 6.1% |
| t | 2085 | 5.7% |
| s | 2049 | 5.6% |
| k | 1758 | 4.8% |
| Other values (14) | 9912 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2794 | |
| R | 2404 | |
| B | 1540 | 8.0% |
| F | 1427 | 7.4% |
| P | 1305 | 6.8% |
| D | 1269 | 6.6% |
| C | 1158 | 6.0% |
| Q | 1081 | 5.6% |
| A | 987 | 5.1% |
| G | 959 | 5.0% |
| Other values (13) | 4265 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1620 | |
| 6 | 981 | |
| 2 | 973 | |
| 4 | 614 | 11.8% |
| 3 | 296 | 5.7% |
| 7 | 261 | 5.0% |
| 9 | 235 | 4.5% |
| 5 | 198 | 3.8% |
| 8 | 7 | 0.1% |
| 0 | 3 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 766 | |
| . | 59 | 7.0% |
| , | 18 | 2.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5994 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 95 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55830 | |
| Common | 12120 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 3667 | 6.6% |
| r | 3216 | 5.8% |
| y | 3176 | 5.7% |
| w | 3170 | 5.7% |
| E | 2794 | 5.0% |
| a | 2732 | 4.9% |
| e | 2657 | 4.8% |
| R | 2404 | 4.3% |
| o | 2219 | 4.0% |
| t | 2085 | 3.7% |
| Other values (37) | 27710 |
Common
| Value | Count | Frequency (%) |
| 5994 | ||
| 1 | 1620 | 13.4% |
| 6 | 981 | 8.1% |
| 2 | 973 | 8.0% |
| / | 766 | 6.3% |
| 4 | 614 | 5.1% |
| 3 | 296 | 2.4% |
| 7 | 261 | 2.2% |
| 9 | 235 | 1.9% |
| 5 | 198 | 1.6% |
| Other values (5) | 182 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5994 | 8.8% | |
| n | 3667 | 5.4% |
| r | 3216 | 4.7% |
| y | 3176 | 4.7% |
| w | 3170 | 4.7% |
| E | 2794 | 4.1% |
| a | 2732 | 4.0% |
| e | 2657 | 3.9% |
| R | 2404 | 3.5% |
| o | 2219 | 3.3% |
| Other values (52) | 35921 |
MISSING 
| Distinct | 244 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 2147873 |
| Missing (%) | 99.6% |
| Memory size | 16.5 MiB |
Length
| Max length | 54 |
|---|---|
| Median length | 36 |
| Mean length | 19.154684 |
| Min length | 8 |
Characters and Unicode
| Total characters | 160114 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 6 Uptown & The Bronx |
|---|---|
| 2nd row | 2 3 Downtown & Brooklyn |
| 3rd row | L to Brooklyn |
| 4th row | B D F M Uptown & The Bronx - Queens |
| 5th row | M R to Manhattan |
| Value | Count | Frequency (%) |
| 2769 | 9.1% | |
| bound | 2206 | 7.2% |
| downtown | 1592 | 5.2% |
| to | 1481 | 4.9% |
| uptown | 1440 | 4.7% |
| brooklyn | 1167 | 3.8% |
| bronx | 1093 | 3.6% |
| the | 1078 | 3.5% |
| 1 | 1067 | 3.5% |
| r | 904 | 3.0% |
| Other values (145) | 15724 |
Most occurring characters
| Value | Count | Frequency (%) |
| 22162 | 13.8% | |
| o | 18323 | 11.4% |
| n | 15578 | 9.7% |
| t | 11715 | 7.3% |
| r | 6227 | 3.9% |
| B | 6123 | 3.8% |
| a | 5925 | 3.7% |
| w | 5762 | 3.6% |
| e | 5570 | 3.5% |
| u | 5377 | 3.4% |
| Other values (54) | 57352 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 103045 | |
| Uppercase Letter | 25296 | 15.8% |
| Space Separator | 22162 | 13.8% |
| Other Punctuation | 5131 | 3.2% |
| Decimal Number | 3816 | 2.4% |
| Dash Punctuation | 474 | 0.3% |
| Open Punctuation | 95 | 0.1% |
| Close Punctuation | 95 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 18323 | |
| n | 15578 | |
| t | 11715 | |
| r | 6227 | 6.0% |
| a | 5925 | 5.7% |
| w | 5762 | 5.6% |
| e | 5570 | 5.4% |
| u | 5377 | 5.2% |
| d | 4412 | 4.3% |
| h | 4218 | 4.1% |
| Other values (14) | 19938 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 6123 | |
| D | 2103 | 8.3% |
| T | 1765 | 7.0% |
| U | 1608 | 6.4% |
| W | 1455 | 5.8% |
| E | 1398 | 5.5% |
| M | 1238 | 4.9% |
| R | 1207 | 4.8% |
| N | 1200 | 4.7% |
| S | 1131 | 4.5% |
| Other values (14) | 6068 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1083 | |
| 2 | 608 | |
| 6 | 562 | |
| 3 | 558 | |
| 5 | 447 | |
| 4 | 358 | 9.4% |
| 7 | 76 | 2.0% |
| 9 | 51 | 1.3% |
| 8 | 47 | 1.2% |
| 0 | 26 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2832 | |
| & | 2299 |
Space Separator
| Value | Count | Frequency (%) |
| 22162 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 474 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 95 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 95 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 128341 | |
| Common | 31773 | 19.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 18323 | |
| n | 15578 | 12.1% |
| t | 11715 | 9.1% |
| r | 6227 | 4.9% |
| B | 6123 | 4.8% |
| a | 5925 | 4.6% |
| w | 5762 | 4.5% |
| e | 5570 | 4.3% |
| u | 5377 | 4.2% |
| d | 4412 | 3.4% |
| Other values (38) | 43329 |
Common
| Value | Count | Frequency (%) |
| 22162 | ||
| / | 2832 | 8.9% |
| & | 2299 | 7.2% |
| 1 | 1083 | 3.4% |
| 2 | 608 | 1.9% |
| 6 | 562 | 1.8% |
| 3 | 558 | 1.8% |
| - | 474 | 1.5% |
| 5 | 447 | 1.4% |
| 4 | 358 | 1.1% |
| Other values (6) | 390 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160114 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 22162 | 13.8% | |
| o | 18323 | 11.4% |
| n | 15578 | 9.7% |
| t | 11715 | 7.3% |
| r | 6227 | 3.9% |
| B | 6123 | 3.8% |
| a | 5925 | 3.7% |
| w | 5762 | 3.6% |
| e | 5570 | 3.5% |
| u | 5377 | 3.4% |
| Other values (54) | 57352 |
Road Ramp
Text
MISSING 
| Distinct | 410 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 2151397 |
| Missing (%) | 99.8% |
| Memory size | 16.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 39 |
| Mean length | 11.671768 |
| Min length | 3 |
Characters and Unicode
| Total characters | 56433 |
|---|---|
| Distinct characters | 67 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 177 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | Montrose Av & Bushwick Av |
|---|---|
| 2nd row | Roadway |
| 3rd row | Roadway |
| 4th row | Roadway |
| 5th row | Roadway |
| Value | Count | Frequency (%) |
| roadway | 2204 | |
| 1582 | 12.2% | |
| st | 1367 | 10.5% |
| ramp | 1187 | 9.1% |
| av | 995 | 7.7% |
| w | 495 | 3.8% |
| to | 274 | 2.1% |
| e | 228 | 1.8% |
| broadway | 177 | 1.4% |
| 7 | 117 | 0.9% |
| Other values (434) | 4379 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8170 | ||
| a | 7041 | 12.5% |
| o | 3951 | 7.0% |
| R | 3462 | 6.1% |
| t | 2880 | 5.1% |
| d | 2727 | 4.8% |
| w | 2666 | 4.7% |
| y | 2629 | 4.7% |
| n | 1571 | 2.8% |
| S | 1571 | 2.8% |
| Other values (57) | 19765 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34125 | |
| Uppercase Letter | 9717 | 17.2% |
| Space Separator | 8170 | 14.5% |
| Decimal Number | 2605 | 4.6% |
| Other Punctuation | 1661 | 2.9% |
| Open Punctuation | 87 | 0.2% |
| Dash Punctuation | 45 | 0.1% |
| Close Punctuation | 23 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7041 | |
| o | 3951 | |
| t | 2880 | |
| d | 2727 | 8.0% |
| w | 2666 | 7.8% |
| y | 2629 | 7.7% |
| n | 1571 | 4.6% |
| m | 1397 | 4.1% |
| p | 1376 | 4.0% |
| r | 1303 | 3.8% |
| Other values (15) | 6584 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3462 | |
| S | 1571 | |
| A | 1233 | 12.7% |
| W | 853 | 8.8% |
| B | 439 | 4.5% |
| E | 346 | 3.6% |
| P | 230 | 2.4% |
| C | 180 | 1.9% |
| N | 175 | 1.8% |
| F | 171 | 1.8% |
| Other values (14) | 1057 | 10.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 471 | |
| 8 | 334 | |
| 7 | 318 | |
| 4 | 305 | |
| 2 | 277 | |
| 3 | 267 | |
| 5 | 212 | |
| 6 | 195 | |
| 9 | 146 | 5.6% |
| 0 | 80 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 1470 | |
| / | 136 | 8.2% |
| , | 48 | 2.9% |
| . | 7 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 8170 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 87 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 45 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43842 | |
| Common | 12591 | 22.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7041 | |
| o | 3951 | 9.0% |
| R | 3462 | 7.9% |
| t | 2880 | 6.6% |
| d | 2727 | 6.2% |
| w | 2666 | 6.1% |
| y | 2629 | 6.0% |
| n | 1571 | 3.6% |
| S | 1571 | 3.6% |
| m | 1397 | 3.2% |
| Other values (39) | 13947 |
Common
| Value | Count | Frequency (%) |
| 8170 | ||
| & | 1470 | 11.7% |
| 1 | 471 | 3.7% |
| 8 | 334 | 2.7% |
| 7 | 318 | 2.5% |
| 4 | 305 | 2.4% |
| 2 | 277 | 2.2% |
| 3 | 267 | 2.1% |
| 5 | 212 | 1.7% |
| 6 | 195 | 1.5% |
| Other values (8) | 572 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56433 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8170 | ||
| a | 7041 | 12.5% |
| o | 3951 | 7.0% |
| R | 3462 | 6.1% |
| t | 2880 | 5.1% |
| d | 2727 | 4.8% |
| w | 2666 | 4.7% |
| y | 2629 | 4.7% |
| n | 1571 | 2.8% |
| S | 1571 | 2.8% |
| Other values (57) | 19765 |
MISSING 
| Distinct | 860 |
|---|---|
| Distinct (%) | 5.4% |
| Missing | 2140241 |
| Missing (%) | 99.3% |
| Memory size | 16.5 MiB |
Length
| Max length | 100 |
|---|---|
| Median length | 99 |
| Mean length | 16.791883 |
| Min length | 3 |
Characters and Unicode
| Total characters | 268519 |
|---|---|
| Distinct characters | 68 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 237 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Mezzanine |
|---|---|
| 2nd row | Mezzanine |
| 3rd row | Stairway |
| 4th row | Platform |
| 5th row | Platform |
| Value | Count | Frequency (%) |
| platform | 5307 | 11.9% |
| exit | 4346 | 9.7% |
| mezzanine | 3578 | 8.0% |
| 2307 | 5.2% | |
| entrance | 1713 | 3.8% |
| st | 1463 | 3.3% |
| ave | 1428 | 3.2% |
| stairway | 958 | 2.1% |
| blvd | 886 | 2.0% |
| expwy | 790 | 1.8% |
| Other values (695) | 21990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 28834 | 10.7% | |
| a | 19883 | 7.4% |
| t | 18803 | 7.0% |
| e | 17238 | 6.4% |
| n | 16593 | 6.2% |
| r | 13913 | 5.2% |
| i | 12377 | 4.6% |
| o | 9981 | 3.7% |
| l | 9040 | 3.4% |
| E | 8282 | 3.1% |
| Other values (58) | 113575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 173669 | |
| Uppercase Letter | 38932 | 14.5% |
| Space Separator | 28834 | 10.7% |
| Decimal Number | 12557 | 4.7% |
| Open Punctuation | 5363 | 2.0% |
| Close Punctuation | 5294 | 2.0% |
| Dash Punctuation | 2980 | 1.1% |
| Other Punctuation | 890 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 19883 | |
| t | 18803 | |
| e | 17238 | |
| n | 16593 | 9.6% |
| r | 13913 | 8.0% |
| i | 12377 | 7.1% |
| o | 9981 | 5.7% |
| l | 9040 | 5.2% |
| z | 7281 | 4.2% |
| m | 6147 | 3.5% |
| Other values (16) | 42413 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 8282 | |
| P | 6979 | |
| M | 4297 | |
| S | 3057 | 7.9% |
| B | 2920 | 7.5% |
| A | 2590 | 6.7% |
| W | 1352 | 3.5% |
| R | 1095 | 2.8% |
| C | 1048 | 2.7% |
| I | 1017 | 2.6% |
| Other values (15) | 6295 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2473 | |
| 2 | 2174 | |
| 5 | 1250 | |
| 3 | 1234 | |
| 4 | 1095 | |
| 9 | 1043 | |
| 7 | 1036 | |
| 8 | 955 | 7.6% |
| 6 | 815 | 6.5% |
| 0 | 482 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 730 | |
| . | 147 | 16.5% |
| , | 13 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 28834 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5363 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5294 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2980 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 212601 | |
| Common | 55918 | 20.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 19883 | 9.4% |
| t | 18803 | 8.8% |
| e | 17238 | 8.1% |
| n | 16593 | 7.8% |
| r | 13913 | 6.5% |
| i | 12377 | 5.8% |
| o | 9981 | 4.7% |
| l | 9040 | 4.3% |
| E | 8282 | 3.9% |
| z | 7281 | 3.4% |
| Other values (41) | 79210 |
Common
| Value | Count | Frequency (%) |
| 28834 | ||
| ( | 5363 | 9.6% |
| ) | 5294 | 9.5% |
| - | 2980 | 5.3% |
| 1 | 2473 | 4.4% |
| 2 | 2174 | 3.9% |
| 5 | 1250 | 2.2% |
| 3 | 1234 | 2.2% |
| 4 | 1095 | 2.0% |
| 9 | 1043 | 1.9% |
| Other values (7) | 4178 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 268519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 28834 | 10.7% | |
| a | 19883 | 7.4% |
| t | 18803 | 7.0% |
| e | 17238 | 6.4% |
| n | 16593 | 6.2% |
| r | 13913 | 5.2% |
| i | 12377 | 4.6% |
| o | 9981 | 3.7% |
| l | 9040 | 3.4% |
| E | 8282 | 3.1% |
| Other values (58) | 113575 |
Latitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 464593 |
|---|---|
| Distinct (%) | 21.9% |
| Missing | 33616 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.732729 |
| Minimum | 40.498807 |
|---|---|
| Maximum | 40.912869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 40.498807 |
|---|---|
| 5-th percentile | 40.59741 |
| Q1 | 40.672162 |
| median | 40.726149 |
| Q3 | 40.806275 |
| 95-th percentile | 40.868183 |
| Maximum | 40.912869 |
| Range | 0.41406209 |
| Interquartile range (IQR) | 0.1341138 |
Descriptive statistics
| Standard deviation | 0.084668591 |
|---|---|
| Coefficient of variation (CV) | 0.0020786378 |
| Kurtosis | -0.82581567 |
| Mean | 40.732729 |
| Median Absolute Deviation (MAD) | 0.062202949 |
| Skewness | 0.023693011 |
| Sum | 86459942 |
| Variance | 0.0071687704 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.72195913 | 8184 | 0.4% |
| 40.71724957 | 3845 | 0.2% |
| 40.7786329 | 2693 | 0.1% |
| 40.8315271 | 2583 | 0.1% |
| 40.68364303 | 2582 | 0.1% |
| 40.89187242 | 2305 | 0.1% |
| 40.65475302 | 2292 | 0.1% |
| 40.6684778 | 2261 | 0.1% |
| 40.81853516 | 1986 | 0.1% |
| 40.64832049 | 1970 | 0.1% |
| Other values (464583) | 2091915 | |
| (Missing) | 33616 | 1.6% |
| Value | Count | Frequency (%) |
| 40.4988067 | 1 | < 0.1% |
| 40.49892396 | 1 | < 0.1% |
| 40.49894885 | 1 | < 0.1% |
| 40.49905831 | 2 | < 0.1% |
| 40.49912214 | 1 | < 0.1% |
| 40.49927504 | 1 | < 0.1% |
| 40.49930796 | 3 | < 0.1% |
| 40.49937854 | 5 | < 0.1% |
| 40.49940396 | 1 | < 0.1% |
| 40.49940871 | 15 |
| Value | Count | Frequency (%) |
| 40.9128688 | 23 | |
| 40.91282765 | 1 | < 0.1% |
| 40.91246817 | 4 | < 0.1% |
| 40.91231946 | 17 | |
| 40.91230844 | 1 | < 0.1% |
| 40.91228642 | 6 | < 0.1% |
| 40.9122754 | 3 | < 0.1% |
| 40.91221958 | 1 | < 0.1% |
| 40.91221758 | 3 | < 0.1% |
| 40.9121894 | 1 | < 0.1% |
Longitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 464594 |
|---|---|
| Distinct (%) | 21.9% |
| Missing | 33616 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.92393 |
| Minimum | -74.254952 |
|---|---|
| Maximum | -73.700376 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2122616 |
| Negative (%) | 98.4% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | -74.254952 |
|---|---|
| 5-th percentile | -74.021496 |
| Q1 | -73.970228 |
| median | -73.928211 |
| Q3 | -73.87769 |
| 95-th percentile | -73.79064 |
| Maximum | -73.700376 |
| Range | 0.5545754 |
| Interquartile range (IQR) | 0.092538509 |
Descriptive statistics
| Standard deviation | 0.077910036 |
|---|---|
| Coefficient of variation (CV) | -0.0010539217 |
| Kurtosis | 1.5823323 |
| Mean | -73.92393 |
| Median Absolute Deviation (MAD) | 0.045738213 |
| Skewness | -0.32727814 |
| Sum | -1.5691212 × 108 |
| Variance | 0.0060699737 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.80969682 | 8184 | 0.4% |
| -73.80343341 | 3845 | 0.2% |
| -73.9625461 | 2693 | 0.1% |
| -73.92886303 | 2583 | 0.1% |
| -74.00030287 | 2582 | 0.1% |
| -73.86016845 | 2305 | 0.1% |
| -73.87323986 | 2292 | 0.1% |
| -73.84687385 | 2261 | 0.1% |
| -73.9413558 | 1986 | 0.1% |
| -73.78828125 | 1970 | 0.1% |
| Other values (464584) | 2091915 | |
| (Missing) | 33616 | 1.6% |
| Value | Count | Frequency (%) |
| -74.25495172 | 1 | < 0.1% |
| -74.25473797 | 1 | < 0.1% |
| -74.25473091 | 1 | < 0.1% |
| -74.2546657 | 2 | |
| -74.25462445 | 1 | < 0.1% |
| -74.2545726 | 3 | |
| -74.25437535 | 1 | < 0.1% |
| -74.25422295 | 1 | < 0.1% |
| -74.25394246 | 1 | < 0.1% |
| -74.25376196 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -73.70037632 | 4 | |
| -73.70038355 | 2 | < 0.1% |
| -73.70059685 | 2 | < 0.1% |
| -73.70073668 | 2 | < 0.1% |
| -73.70074293 | 6 | |
| -73.70075063 | 2 | < 0.1% |
| -73.70076037 | 1 | < 0.1% |
| -73.7007611 | 1 | < 0.1% |
| -73.70090285 | 1 | < 0.1% |
| -73.70090631 | 2 | < 0.1% |
Location
Text
MISSING 
| Distinct | 464601 |
|---|---|
| Distinct (%) | 21.9% |
| Missing | 33616 |
| Missing (%) | 1.6% |
| Memory size | 16.5 MiB |
Length
| Max length | 90 |
|---|---|
| Median length | 39 |
| Mean length | 39.054151 |
| Min length | 26 |
Characters and Unicode
| Total characters | 82896966 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 215292 ? |
|---|---|
| Unique (%) | 10.1% |
Sample
| 1st row | (40.65665084455846, -73.90933142555113) |
|---|---|
| 2nd row | (40.59303241455525, -73.9572059782219) |
| 3rd row | (40.85844745053305, -73.92927888866892) |
| 4th row | (40.66889502741585, -73.93294405917439) |
| 5th row | (40.892021197823865, -73.86063833365542) |
| Value | Count | Frequency (%) |
| 40.72195913199264 | 8184 | 0.2% |
| 73.80969682426189 | 8184 | 0.2% |
| 40.71724956934783 | 3845 | 0.1% |
| 73.80343340538089 | 3845 | 0.1% |
| 40.77863290127972 | 2693 | 0.1% |
| 73.96254609596876 | 2693 | 0.1% |
| 40.8315271048226 | 2583 | 0.1% |
| 73.92886303221528 | 2583 | 0.1% |
| 40.68364302931577 | 2582 | 0.1% |
| 74.00030286766025 | 2582 | 0.1% |
| Other values (929185) | 4205469 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 8477235 | |
| 4 | 8061955 | |
| 3 | 7416968 | |
| 0 | 7346144 | |
| 9 | 6727450 | |
| 8 | 6711940 | |
| 6 | 6473622 | |
| 5 | 5831032 | 7.0% |
| 2 | 5532653 | 6.7% |
| 1 | 5459597 | 6.6% |
| Other values (22) | 14858370 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 68038596 | |
| Other Punctuation | 6367853 | 7.7% |
| Space Separator | 2122629 | 2.6% |
| Open Punctuation | 2122617 | 2.6% |
| Dash Punctuation | 2122616 | 2.6% |
| Close Punctuation | 2122616 | 2.6% |
| Lowercase Letter | 34 | < 0.1% |
| Control | 4 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 8 | |
| e | 6 | |
| t | 4 | |
| s | 4 | |
| a | 3 | 8.8% |
| o | 2 | 5.9% |
| u | 2 | 5.9% |
| n | 2 | 5.9% |
| l | 1 | 2.9% |
| g | 1 | 2.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 8477235 | |
| 4 | 8061955 | |
| 3 | 7416968 | |
| 0 | 7346144 | |
| 9 | 6727450 | |
| 8 | 6711940 | |
| 6 | 6473622 | |
| 5 | 5831032 | |
| 2 | 5532653 | |
| 1 | 5459597 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4245232 | |
| , | 2122618 | |
| : | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2122616 | |
| { | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2122615 | |
| } | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2122629 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2122616 |
Control
| Value | Count | Frequency (%) |
| 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 82896931 | |
| Latin | 35 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 8477235 | |
| 4 | 8061955 | |
| 3 | 7416968 | |
| 0 | 7346144 | |
| 9 | 6727450 | |
| 8 | 6711940 | |
| 6 | 6473622 | |
| 5 | 5831032 | 7.0% |
| 2 | 5532653 | 6.7% |
| 1 | 5459597 | 6.6% |
| Other values (10) | 14858335 |
Latin
| Value | Count | Frequency (%) |
| r | 8 | |
| e | 6 | |
| t | 4 | |
| s | 4 | |
| a | 3 | 8.6% |
| o | 2 | 5.7% |
| u | 2 | 5.7% |
| n | 2 | 5.7% |
| I | 1 | 2.9% |
| l | 1 | 2.9% |
| Other values (2) | 2 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 82896966 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 8477235 | |
| 4 | 8061955 | |
| 3 | 7416968 | |
| 0 | 7346144 | |
| 9 | 6727450 | |
| 8 | 6711940 | |
| 6 | 6473622 | |
| 5 | 5831032 | 7.0% |
| 2 | 5532653 | 6.7% |
| 1 | 5459597 | 6.6% |
| Other values (22) | 14858370 |
Zip Codes
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 219 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 42923 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14643.099 |
| Minimum | 10090 |
|---|---|
| Maximum | 26001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 10090 |
|---|---|
| 5-th percentile | 10700 |
| Q1 | 11723 |
| median | 13516 |
| Q3 | 17213 |
| 95-th percentile | 24016 |
| Maximum | 26001 |
| Range | 15911 |
| Interquartile range (IQR) | 5490 |
Descriptive statistics
| Standard deviation | 3623.2429 |
|---|---|
| Coefficient of variation (CV) | 0.24743689 |
| Kurtosis | 1.0124105 |
| Mean | 14643.099 |
| Median Absolute Deviation (MAD) | 2241 |
| Skewness | 1.2031337 |
| Sum | 3.0945393 × 1010 |
| Variance | 13127889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17215 | 32340 | 1.5% |
| 13510 | 31767 | 1.5% |
| 15310 | 29850 | 1.4% |
| 16865 | 29373 | 1.4% |
| 11605 | 28885 | 1.3% |
| 10930 | 28482 | 1.3% |
| 11606 | 28198 | 1.3% |
| 17613 | 28124 | 1.3% |
| 10935 | 27728 | 1.3% |
| 10934 | 27033 | 1.3% |
| Other values (209) | 1821529 | |
| (Missing) | 42923 | 2.0% |
| Value | Count | Frequency (%) |
| 10090 | 5941 | |
| 10091 | 1058 | < 0.1% |
| 10092 | 3326 | |
| 10093 | 147 | < 0.1% |
| 10094 | 111 | < 0.1% |
| 10095 | 88 | < 0.1% |
| 10096 | 17 | < 0.1% |
| 10097 | 9 | < 0.1% |
| 10098 | 278 | < 0.1% |
| 10099 | 8282 |
| Value | Count | Frequency (%) |
| 26001 | 31 | < 0.1% |
| 25293 | 1 | < 0.1% |
| 24894 | 47 | < 0.1% |
| 24672 | 62 | < 0.1% |
| 24671 | 5639 | 0.3% |
| 24670 | 11599 | |
| 24669 | 11894 | |
| 24668 | 10629 | |
| 24340 | 15449 | |
| 24339 | 2202 | 0.1% |
Community Districts
Real number (ℝ)
MISSING 
| Distinct | 71 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34126 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.751201 |
| Minimum | 1 |
|---|---|
| Maximum | 71 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 18 |
| median | 39 |
| Q3 | 54 |
| 95-th percentile | 69 |
| Maximum | 71 |
| Range | 70 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 20.595558 |
|---|---|
| Coefficient of variation (CV) | 0.56040502 |
| Kurtosis | -1.2188162 |
| Mean | 36.751201 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.062213299 |
| Sum | 77989944 |
| Variance | 424.17699 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47 | 64061 | 3.0% |
| 36 | 56640 | 2.6% |
| 45 | 54424 | 2.5% |
| 41 | 53205 | 2.5% |
| 54 | 52869 | 2.5% |
| 50 | 51614 | 2.4% |
| 39 | 50908 | 2.4% |
| 20 | 49686 | 2.3% |
| 18 | 48246 | 2.2% |
| 24 | 46785 | 2.2% |
| Other values (61) | 1593668 |
| Value | Count | Frequency (%) |
| 1 | 36663 | |
| 2 | 32297 | |
| 3 | 1025 | < 0.1% |
| 4 | 35128 | |
| 5 | 34029 | |
| 6 | 43464 | |
| 7 | 27612 | |
| 8 | 15076 | 0.7% |
| 9 | 28607 | |
| 10 | 29454 |
| Value | Count | Frequency (%) |
| 71 | 23283 | |
| 70 | 41348 | |
| 69 | 44968 | |
| 68 | 46556 | |
| 67 | 1885 | 0.1% |
| 66 | 30648 | |
| 65 | 33611 | |
| 64 | 2028 | 0.1% |
| 63 | 33993 | |
| 62 | 41013 |
Borough Boundaries
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34131 |
| Missing (%) | 1.6% |
| Memory size | 16.5 MiB |
| 2.0 | |
|---|---|
| 3.0 | |
| 4.0 | |
| 5.0 | |
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6366303 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 4.0 |
| 4th row | 2.0 |
| 5th row | 5.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 659106 | |
| 3.0 | 523524 | |
| 4.0 | 454134 | |
| 5.0 | 398214 | |
| 1.0 | 87123 | 4.0% |
| (Missing) | 34131 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 659106 | |
| 3.0 | 523524 | |
| 4.0 | 454134 | |
| 5.0 | 398214 | |
| 1.0 | 87123 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2122101 | |
| 0 | 2122101 | |
| 2 | 659106 | 10.4% |
| 3 | 523524 | 8.2% |
| 4 | 454134 | 7.1% |
| 5 | 398214 | 6.3% |
| 1 | 87123 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4244202 | |
| Other Punctuation | 2122101 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2122101 | |
| 2 | 659106 | 15.5% |
| 3 | 523524 | 12.3% |
| 4 | 454134 | 10.7% |
| 5 | 398214 | 9.4% |
| 1 | 87123 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2122101 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6366303 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2122101 | |
| 0 | 2122101 | |
| 2 | 659106 | 10.4% |
| 3 | 523524 | 8.2% |
| 4 | 454134 | 7.1% |
| 5 | 398214 | 6.3% |
| 1 | 87123 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6366303 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2122101 | |
| 0 | 2122101 | |
| 2 | 659106 | 10.4% |
| 3 | 523524 | 8.2% |
| 4 | 454134 | 7.1% |
| 5 | 398214 | 6.3% |
| 1 | 87123 | 1.4% |
City Council Districts
Real number (ℝ)
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34126 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.761464 |
| Minimum | 1 |
|---|---|
| Maximum | 51 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 29 |
| Q3 | 39 |
| 95-th percentile | 49 |
| Maximum | 51 |
| Range | 50 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 14.257871 |
|---|---|
| Coefficient of variation (CV) | 0.51358497 |
| Kurtosis | -1.1221083 |
| Mean | 27.761464 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.16992589 |
| Sum | 58912770 |
| Variance | 203.28688 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 65746 | 3.0% |
| 36 | 62669 | 2.9% |
| 38 | 61304 | 2.8% |
| 29 | 57564 | 2.7% |
| 39 | 56422 | 2.6% |
| 22 | 55546 | 2.6% |
| 48 | 54350 | 2.5% |
| 30 | 51818 | 2.4% |
| 46 | 50545 | 2.3% |
| 35 | 49740 | 2.3% |
| Other values (41) | 1556402 |
| Value | Count | Frequency (%) |
| 1 | 22182 | 1.0% |
| 2 | 36251 | |
| 3 | 26683 | |
| 4 | 38522 | |
| 5 | 28735 | |
| 6 | 33665 | |
| 7 | 37835 | |
| 8 | 29324 | |
| 9 | 28076 | |
| 10 | 65746 |
| Value | Count | Frequency (%) |
| 51 | 42403 | |
| 50 | 45401 | |
| 49 | 45656 | |
| 48 | 54350 | |
| 47 | 29216 | |
| 46 | 50545 | |
| 45 | 36184 | |
| 44 | 37328 | |
| 43 | 49658 | |
| 42 | 46905 |
Police Precincts
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 77 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34126 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.66148 |
| Minimum | 1 |
|---|---|
| Maximum | 77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 26 |
| median | 41 |
| Q3 | 61 |
| 95-th percentile | 73 |
| Maximum | 77 |
| Range | 76 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 21.013409 |
|---|---|
| Coefficient of variation (CV) | 0.5043846 |
| Kurtosis | -1.0818418 |
| Mean | 41.66148 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.10779936 |
| Sum | 88410077 |
| Variance | 441.56336 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47 | 54464 | 2.5% |
| 62 | 53105 | 2.5% |
| 72 | 51744 | 2.4% |
| 27 | 51618 | 2.4% |
| 34 | 46909 | 2.2% |
| 67 | 45040 | 2.1% |
| 30 | 44117 | 2.0% |
| 29 | 43480 | 2.0% |
| 43 | 43424 | 2.0% |
| 64 | 41045 | 1.9% |
| Other values (67) | 1647160 |
| Value | Count | Frequency (%) |
| 1 | 21266 | |
| 2 | 14188 | |
| 3 | 19122 | |
| 4 | 12592 | 0.6% |
| 5 | 23465 | |
| 6 | 15585 | |
| 7 | 20817 | |
| 8 | 19072 | |
| 9 | 12639 | 0.6% |
| 10 | 32463 |
| Value | Count | Frequency (%) |
| 77 | 18695 | 0.9% |
| 76 | 24614 | |
| 75 | 20001 | 0.9% |
| 74 | 23813 | |
| 73 | 34530 | |
| 72 | 51744 | |
| 71 | 29001 | |
| 70 | 22661 | |
| 69 | 21319 | |
| 68 | 33421 |
Request Closing Time
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 163693 |
|---|---|
| Missing (%) | 7.6% |
| Memory size | 16.5 MiB |
| Unique Key | Incident Zip | BBL | X Coordinate (State Plane) | Y Coordinate (State Plane) | Latitude | Longitude | Zip Codes | Community Districts | City Council Districts | Police Precincts | Agency | Agency Name | Address Type | Facility Type | Status | Borough | Open Data Channel Type | Park Borough | Vehicle Type | Taxi Company Borough | Borough Boundaries | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unique Key | 1.000 | 0.001 | -0.006 | -0.001 | -0.003 | -0.003 | -0.001 | 0.002 | 0.011 | 0.008 | -0.004 | 0.058 | 0.058 | 0.015 | 0.025 | 0.102 | 0.038 | 0.024 | 0.038 | 0.118 | 0.000 | 0.019 |
| Incident Zip | 0.001 | 1.000 | 0.801 | 0.533 | -0.456 | -0.456 | 0.532 | 0.714 | 0.175 | -0.034 | 0.743 | 0.024 | 0.024 | 0.019 | 1.000 | 0.003 | 0.192 | 0.000 | 0.192 | 1.000 | 1.000 | 0.001 |
| BBL | -0.006 | 0.801 | 1.000 | 0.356 | -0.576 | -0.576 | 0.355 | 0.530 | 0.047 | -0.191 | 0.917 | 0.109 | 0.109 | 0.104 | 0.233 | 0.040 | 0.893 | 0.065 | 0.893 | 0.122 | 0.856 | 1.000 |
| X Coordinate (State Plane) | -0.001 | 0.533 | 0.356 | 1.000 | 0.339 | 0.338 | 1.000 | 0.200 | 0.300 | -0.027 | 0.326 | 0.099 | 0.099 | 0.032 | 0.160 | 0.033 | 0.609 | 0.067 | 0.609 | 0.085 | 0.618 | 0.680 |
| Y Coordinate (State Plane) | -0.003 | -0.456 | -0.576 | 0.339 | 1.000 | 1.000 | 0.340 | -0.534 | 0.069 | 0.018 | -0.450 | 0.112 | 0.112 | 0.030 | 0.253 | 0.039 | 0.572 | 0.062 | 0.572 | 0.105 | 0.523 | 0.638 |
| Latitude | -0.003 | -0.456 | -0.576 | 0.338 | 1.000 | 1.000 | 0.339 | -0.534 | 0.069 | 0.019 | -0.450 | 0.112 | 0.112 | 0.030 | 0.253 | 0.039 | 0.572 | 0.062 | 0.572 | 0.105 | 0.523 | 0.638 |
| Longitude | -0.001 | 0.532 | 0.355 | 1.000 | 0.340 | 0.339 | 1.000 | 0.199 | 0.300 | -0.027 | 0.326 | 0.100 | 0.100 | 0.032 | 0.161 | 0.033 | 0.608 | 0.067 | 0.608 | 0.085 | 0.618 | 0.680 |
| Zip Codes | 0.002 | 0.714 | 0.530 | 0.200 | -0.534 | -0.534 | 0.199 | 1.000 | 0.151 | 0.141 | 0.506 | 0.103 | 0.103 | 0.030 | 0.192 | 0.034 | 0.678 | 0.060 | 0.678 | 0.000 | 0.607 | 0.758 |
| Community Districts | 0.011 | 0.175 | 0.047 | 0.300 | 0.069 | 0.069 | 0.300 | 0.151 | 1.000 | 0.204 | 0.107 | 0.097 | 0.097 | 0.021 | 0.248 | 0.030 | 0.353 | 0.061 | 0.353 | 0.111 | 0.357 | 0.395 |
| City Council Districts | 0.008 | -0.034 | -0.191 | -0.027 | 0.018 | 0.019 | -0.027 | 0.141 | 0.204 | 1.000 | -0.180 | 0.088 | 0.088 | 0.037 | 0.111 | 0.026 | 0.383 | 0.056 | 0.383 | 0.080 | 0.416 | 0.428 |
| Police Precincts | -0.004 | 0.743 | 0.917 | 0.326 | -0.450 | -0.450 | 0.326 | 0.506 | 0.107 | -0.180 | 1.000 | 0.139 | 0.139 | 0.036 | 0.265 | 0.042 | 0.747 | 0.073 | 0.747 | 0.048 | 0.681 | 0.836 |
| Agency | 0.058 | 0.024 | 0.109 | 0.099 | 0.112 | 0.112 | 0.100 | 0.103 | 0.097 | 0.088 | 0.139 | 1.000 | 1.000 | 0.270 | 0.709 | 0.285 | 0.149 | 0.453 | 0.149 | 1.000 | 1.000 | 0.165 |
| Agency Name | 0.058 | 0.024 | 0.109 | 0.099 | 0.112 | 0.112 | 0.100 | 0.103 | 0.097 | 0.088 | 0.139 | 1.000 | 1.000 | 0.270 | 0.709 | 0.285 | 0.149 | 0.453 | 0.149 | 1.000 | 1.000 | 0.165 |
| Address Type | 0.015 | 0.019 | 0.104 | 0.032 | 0.030 | 0.030 | 0.032 | 0.030 | 0.021 | 0.037 | 0.036 | 0.270 | 0.270 | 1.000 | 0.177 | 0.116 | 0.037 | 0.168 | 0.037 | 0.060 | 0.158 | 0.028 |
| Facility Type | 0.025 | 1.000 | 0.233 | 0.160 | 0.253 | 0.253 | 0.161 | 0.192 | 0.248 | 0.111 | 0.265 | 0.709 | 0.709 | 0.177 | 1.000 | 0.079 | 0.200 | 0.701 | 0.200 | 0.000 | 0.000 | 0.201 |
| Status | 0.102 | 0.003 | 0.040 | 0.033 | 0.039 | 0.039 | 0.033 | 0.034 | 0.030 | 0.026 | 0.042 | 0.285 | 0.285 | 0.116 | 0.079 | 1.000 | 0.041 | 0.140 | 0.041 | 0.000 | 0.000 | 0.045 |
| Borough | 0.038 | 0.192 | 0.893 | 0.609 | 0.572 | 0.572 | 0.608 | 0.678 | 0.353 | 0.383 | 0.747 | 0.149 | 0.149 | 0.037 | 0.200 | 0.041 | 1.000 | 0.061 | 1.000 | 0.124 | 0.856 | 0.998 |
| Open Data Channel Type | 0.024 | 0.000 | 0.065 | 0.067 | 0.062 | 0.062 | 0.067 | 0.060 | 0.061 | 0.056 | 0.073 | 0.453 | 0.453 | 0.168 | 0.701 | 0.140 | 0.061 | 1.000 | 0.061 | 0.000 | 0.153 | 0.061 |
| Park Borough | 0.038 | 0.192 | 0.893 | 0.609 | 0.572 | 0.572 | 0.608 | 0.678 | 0.353 | 0.383 | 0.747 | 0.149 | 0.149 | 0.037 | 0.200 | 0.041 | 1.000 | 0.061 | 1.000 | 0.124 | 0.856 | 0.998 |
| Vehicle Type | 0.118 | 1.000 | 0.122 | 0.085 | 0.105 | 0.105 | 0.085 | 0.000 | 0.111 | 0.080 | 0.048 | 1.000 | 1.000 | 0.060 | 0.000 | 0.000 | 0.124 | 0.000 | 0.124 | 1.000 | 0.061 | 0.120 |
| Taxi Company Borough | 0.000 | 1.000 | 0.856 | 0.618 | 0.523 | 0.523 | 0.618 | 0.607 | 0.357 | 0.416 | 0.681 | 1.000 | 1.000 | 0.158 | 0.000 | 0.000 | 0.856 | 0.153 | 0.856 | 0.061 | 1.000 | 0.862 |
| Borough Boundaries | 0.019 | 0.001 | 1.000 | 0.680 | 0.638 | 0.638 | 0.680 | 0.758 | 0.395 | 0.428 | 0.836 | 0.165 | 0.165 | 0.028 | 0.201 | 0.045 | 0.998 | 0.061 | 0.998 | 0.120 | 0.862 | 1.000 |
| Unique Key | Created Date | Closed Date | Agency | Agency Name | Complaint Type | Descriptor | Location Type | Incident Zip | Incident Address | Street Name | Cross Street 1 | Cross Street 2 | Intersection Street 1 | Intersection Street 2 | Address Type | City | Landmark | Facility Type | Status | Due Date | Resolution Description | Resolution Action Updated Date | Community Board | BBL | Borough | X Coordinate (State Plane) | Y Coordinate (State Plane) | Open Data Channel Type | Park Facility Name | Park Borough | Vehicle Type | Taxi Company Borough | Taxi Pick Up Location | Bridge Highway Name | Bridge Highway Direction | Road Ramp | Bridge Highway Segment | Latitude | Longitude | Location | Zip Codes | Community Districts | Borough Boundaries | City Council Districts | Police Precincts | Request Closing Time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 59348005 | 2023-11-07 12:00:00 | NaT | DSNY | Department of Sanitation | Derelict Vehicles | Derelict Vehicles | Street | 11212.0 | 585 BRISTOL STREET | BRISTOL STREET | LOTT AVENUE | HEGEMAN AVENUE | None | None | ADDRESS | BROOKLYN | None | DSNY Garage | Open | NaT | If the abandoned vehicle meets the criteria to be classified as a derelict (i.e. junk) the Department of Sanitation (DSNY) will investigate and tag the vehicle within three business days. | 2023-11-07 12:00:00 | 16 BROOKLYN | 3.036240e+09 | BROOKLYN | 1009407.0 | 178525.0 | PHONE | Unspecified | BROOKLYN | None | None | None | None | None | None | None | 40.656651 | -73.909331 | (40.65665084455846, -73.90933142555113) | 17614.0 | 55.0 | 2.0 | 25.0 | 46.0 | NaT |
| 1 | 59348006 | 2023-11-07 12:00:00 | NaT | DSNY | Department of Sanitation | Derelict Vehicles | Derelict Vehicles | Street | 11229.0 | 2362 EAST 13 STREET | EAST 13 STREET | GRAVESEND NECK ROAD | AVENUE X | None | None | ADDRESS | BROOKLYN | None | None | Open | NaT | If the abandoned vehicle meets the criteria to be classified as a derelict (i.e. junk) the Department of Sanitation (DSNY) will investigate and tag the vehicle within three business days. | 2023-11-07 12:00:00 | 15 BROOKLYN | 3.073978e+09 | BROOKLYN | 996135.0 | 155337.0 | PHONE | Unspecified | BROOKLYN | None | None | None | None | None | None | None | 40.593032 | -73.957206 | (40.59303241455525, -73.9572059782219) | 13512.0 | 32.0 | 2.0 | 15.0 | 36.0 | NaT |
| 2 | 59352159 | 2023-11-07 12:00:00 | NaT | DSNY | Department of Sanitation | Derelict Vehicles | Derelict Vehicles | Street | 10040.0 | 34 HILLSIDE AVENUE | HILLSIDE AVENUE | BOGARDUS PLACE | ELLWOOD STREET | None | None | ADDRESS | NEW YORK | None | None | Open | NaT | If the abandoned vehicle meets the criteria to be classified as a derelict (i.e. junk) the Department of Sanitation (DSNY) will investigate and tag the vehicle within three business days. | 2023-11-07 12:00:00 | 12 MANHATTAN | 1.021710e+09 | MANHATTAN | 1003813.0 | 252041.0 | PHONE | Unspecified | MANHATTAN | None | None | None | None | None | None | None | 40.858447 | -73.929279 | (40.85844745053305, -73.92927888866892) | 13098.0 | 47.0 | 4.0 | 39.0 | 22.0 | NaT |
| 3 | 59346310 | 2023-11-07 02:21:07 | NaT | DOT | Department of Transportation | Street Condition | Pothole | None | 11105.0 | CRESCENT STREET | CRESCENT STREET | 23 AVENUE | DITMARS BOULEVARD | None | None | BLOCKFACE | QUEENS | None | N/A | Open | NaT | The Department of Transportation referred this complaint to the appropriate Maintenance Unit for repair. | 2023-11-07 02:21:08 | 01 QUEENS | NaN | QUEENS | NaN | NaN | UNKNOWN | Unspecified | QUEENS | None | None | None | None | None | None | None | NaN | NaN | None | NaN | NaN | NaN | NaN | NaN | NaT |
| 4 | 59353590 | 2023-11-07 02:07:50 | NaT | NYPD | New York City Police Department | Panhandling | N/A | Subway | NaN | None | None | None | None | None | None | None | None | None | None | In Progress | NaT | None | 2023-11-07 02:27:43 | Unspecified BROOKLYN | NaN | BROOKLYN | 1002852.0 | 182980.0 | MOBILE | Unspecified | BROOKLYN | None | None | None | 4 | None | None | Mezzanine | 40.668895 | -73.932944 | (40.66889502741585, -73.93294405917439) | 17615.0 | 17.0 | 2.0 | 48.0 | 44.0 | NaT |
| 5 | 59351961 | 2023-11-07 02:07:24 | NaT | NYPD | New York City Police Department | Blocked Driveway | No Access | Street/Sidewalk | 10466.0 | 637 EAST 230 STREET | EAST 230 STREET | CARPENTER AVENUE | LOWERRE PLACE | CARPENTER AVENUE | LOWERRE PLACE | ADDRESS | BRONX | EAST 230 STREET | None | In Progress | NaT | None | NaT | 12 BRONX | 2.048330e+09 | BRONX | 1022781.0 | 264296.0 | PHONE | Unspecified | BRONX | None | None | None | None | None | None | None | 40.892021 | -73.860638 | (40.892021197823865, -73.86063833365542) | 11275.0 | 29.0 | 5.0 | 2.0 | 30.0 | NaT |
| 6 | 59347939 | 2023-11-07 02:07:17 | NaT | NYPD | New York City Police Department | Illegal Parking | Posted Parking Sign Violation | Street/Sidewalk | 11214.0 | 80 BAY 50 STREET | BAY 50 STREET | WEST 16 STREET | PRIVATE CATANZARO SQUARE | WEST 16 STREET | PRIVATE CATANZARO SQUARE | ADDRESS | BROOKLYN | BAY 50 STREET | None | In Progress | NaT | None | 2023-11-07 02:20:53 | 13 BROOKLYN | 3.069170e+09 | BROOKLYN | 988275.0 | 153113.0 | MOBILE | Unspecified | BROOKLYN | None | None | None | None | None | None | None | 40.586935 | -73.985509 | (40.586935033893944, -73.98550860707033) | 17616.0 | 21.0 | 2.0 | 45.0 | 35.0 | NaT |
| 7 | 59343800 | 2023-11-07 02:07:04 | NaT | NYPD | New York City Police Department | Illegal Parking | Blocked Hydrant | Street/Sidewalk | 11235.0 | 2422 EAST 29 STREET | EAST 29 STREET | AVENUE X | AVENUE Y | AVENUE X | AVENUE Y | ADDRESS | BROOKLYN | EAST 29 STREET | None | In Progress | NaT | None | NaT | 15 BROOKLYN | 3.074221e+09 | BROOKLYN | 1000491.0 | 155341.0 | ONLINE | Unspecified | BROOKLYN | None | None | None | None | None | None | None | 40.593036 | -73.941521 | (40.59303648228758, -73.94152143256076) | 13826.0 | 32.0 | 2.0 | 15.0 | 36.0 | NaT |
| 8 | 59345174 | 2023-11-07 02:06:08 | NaT | NYPD | New York City Police Department | Noise - Residential | Loud Music/Party | Residential Building/House | 10031.0 | 514 WEST 136 STREET | WEST 136 STREET | AMSTERDAM AVENUE | BROADWAY | AMSTERDAM AVENUE | BROADWAY | ADDRESS | NEW YORK | WEST 136 STREET | None | In Progress | NaT | None | 2023-11-07 02:33:34 | 09 MANHATTAN | 1.019880e+09 | MANHATTAN | 997300.0 | 238040.0 | ONLINE | Unspecified | MANHATTAN | None | None | None | None | None | None | None | 40.820031 | -73.952851 | (40.82003081301524, -73.952850909447) | 12428.0 | 37.0 | 4.0 | 23.0 | 19.0 | NaT |
| 9 | 59349335 | 2023-11-07 02:04:27 | NaT | NYPD | New York City Police Department | Noise - Street/Sidewalk | Loud Music/Party | Street/Sidewalk | 10456.0 | 1164 SHERIDAN AVENUE | SHERIDAN AVENUE | MCCLELLAN STREET | EAST 167 STREET | MCCLELLAN STREET | EAST 167 STREET | ADDRESS | BRONX | SHERIDAN AVENUE | None | In Progress | NaT | None | NaT | 04 BRONX | 2.024560e+09 | BRONX | 1007148.0 | 242841.0 | ONLINE | Unspecified | BRONX | None | None | None | None | None | None | None | 40.833188 | -73.917254 | (40.83318814574537, -73.91725413909168) | 10934.0 | 50.0 | 5.0 | 42.0 | 27.0 | NaT |
| Unique Key | Created Date | Closed Date | Agency | Agency Name | Complaint Type | Descriptor | Location Type | Incident Zip | Incident Address | Street Name | Cross Street 1 | Cross Street 2 | Intersection Street 1 | Intersection Street 2 | Address Type | City | Landmark | Facility Type | Status | Due Date | Resolution Description | Resolution Action Updated Date | Community Board | BBL | Borough | X Coordinate (State Plane) | Y Coordinate (State Plane) | Open Data Channel Type | Park Facility Name | Park Borough | Vehicle Type | Taxi Company Borough | Taxi Pick Up Location | Bridge Highway Name | Bridge Highway Direction | Road Ramp | Bridge Highway Segment | Latitude | Longitude | Location | Zip Codes | Community Districts | Borough Boundaries | City Council Districts | Police Precincts | Request Closing Time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2156222 | 57023631 | 2023-03-12 18:53:18 | 2023-03-15 09:49:35 | HPD | Department of Housing Preservation and Development | DOOR/WINDOW | WINDOW FRAME | RESIDENTIAL BUILDING | 11374.0 | 63-25 SAUNDERS STREET | SAUNDERS STREET | None | None | None | None | ADDRESS | REGO PARK | None | None | Closed | NaT | The Department of Housing Preservation and Development inspected the following conditions. No violations were issued. The complaint has been closed. | 2023-03-15 00:00:00 | 06 QUEENS | 4.030800e+09 | QUEENS | 1021930.0 | 205172.0 | ONLINE | Unspecified | QUEENS | None | None | None | None | None | None | None | 40.729746 | -73.864048 | (40.729746170156055, -73.86404817380075) | 14785.0 | 40.0 | 3.0 | 28.0 | 70.0 | 2 days 14:56:17 |
| 2156223 | 57025426 | 2023-03-12 18:53:18 | 2023-03-15 09:49:35 | HPD | Department of Housing Preservation and Development | HEAT/HOT WATER | APARTMENT ONLY | RESIDENTIAL BUILDING | 11374.0 | 63-25 SAUNDERS STREET | SAUNDERS STREET | None | None | None | None | ADDRESS | REGO PARK | None | None | Closed | NaT | The Department of Housing Preservation and Development inspected the following conditions. No violations were issued. The complaint has been closed. | 2023-03-15 00:00:00 | 06 QUEENS | 4.030800e+09 | QUEENS | 1021930.0 | 205172.0 | ONLINE | Unspecified | QUEENS | None | None | None | None | None | None | None | 40.729746 | -73.864048 | (40.729746170156055, -73.86404817380075) | 14785.0 | 40.0 | 3.0 | 28.0 | 70.0 | 2 days 14:56:17 |
| 2156224 | 57027485 | 2023-03-12 18:53:00 | 2023-03-12 22:56:00 | DEP | Department of Environmental Protection | Water System | Hydrant Running (WC3) | None | 11104.0 | 50-44 42 STREET | 42 STREET | 50 AVE | 51 AVE | None | None | ADDRESS | SUNNYSIDE | None | None | Closed | NaT | The Department of Environmental Protection determined that this complaint is a duplicate of a previously filed complaint. The original complaint is being addressed. | 2023-03-12 22:56:00 | 02 QUEENS | 4.001810e+09 | QUEENS | 1005377.0 | 207746.0 | PHONE | Unspecified | QUEENS | None | None | None | None | None | None | None | 40.736866 | -73.923764 | (40.73686618999776, -73.92376432151327) | 16861.0 | 53.0 | 3.0 | 33.0 | 66.0 | 0 days 04:03:00 |
| 2156225 | 57022239 | 2023-03-12 18:53:00 | 2023-03-12 21:05:00 | DEP | Department of Environmental Protection | Water System | Hydrant Running (WC3) | None | 10003.0 | 16 WASHINGTON PLACE | WASHINGTON PLACE | MERCER ST | GREENE ST | None | None | ADDRESS | NEW YORK | None | None | Closed | NaT | The Department of Environmental Protection investigated this complaint and shut the running hydrant. | 2023-03-12 21:05:00 | 02 MANHATTAN | 1.005460e+09 | MANHATTAN | 985580.0 | 205131.0 | MOBILE | Unspecified | MANHATTAN | None | None | None | None | None | None | None | 40.729714 | -73.995201 | (40.729713791524404, -73.99520128002949) | 11724.0 | 57.0 | 4.0 | 32.0 | 3.0 | 0 days 02:12:00 |
| 2156226 | 57024468 | 2023-03-12 18:52:57 | 2023-03-13 12:05:42 | DCA | Department of Consumer Affairs | Consumer Complaint | Retail Store | Business | 11214.0 | 8222 18 AVENUE | 18 AVENUE | 82 STREET | 83 STREET | 82 STREET | 83 STREET | ADDRESS | BROOKLYN | 18 AVENUE | None | Closed | NaT | Unfortunately, the behavior that you complained about does not violate any law or rule. As a result, no city agency has the jurisdiction to act on the matter. | 2023-03-13 12:05:51 | 11 BROOKLYN | 3.063140e+09 | BROOKLYN | 984100.0 | 161140.0 | PHONE | Unspecified | BROOKLYN | None | None | None | None | None | None | None | 40.608968 | -74.000540 | (40.608968439263236, -74.00054023014746) | 17616.0 | 1.0 | 2.0 | 44.0 | 37.0 | 0 days 17:12:45 |
| 2156227 | 57026551 | 2023-03-12 18:52:44 | 2023-03-13 03:39:42 | NYPD | New York City Police Department | Illegal Parking | Blocked Hydrant | Street/Sidewalk | 11385.0 | 78-36 79 PLACE | 79 PLACE | 78 AVENUE | MYRTLE AVENUE | 78 AVENUE | MYRTLE AVENUE | ADDRESS | RIDGEWOOD | 79 PLACE | None | Closed | NaT | The Police Department responded and upon arrival those responsible for the condition were gone. | 2023-03-13 03:39:47 | 05 QUEENS | 4.038280e+09 | QUEENS | 1020202.0 | 195810.0 | MOBILE | Unspecified | QUEENS | None | None | None | None | None | None | None | 40.704057 | -73.870333 | (40.70405694530993, -73.8703328992707) | 15310.0 | 54.0 | 3.0 | 34.0 | 62.0 | 0 days 08:46:58 |
| 2156228 | 57026829 | 2023-03-12 18:52:30 | 2023-03-12 22:53:52 | NYPD | New York City Police Department | Noise - Residential | Loud Music/Party | Residential Building/House | 11102.0 | 30-07 NEWTOWN AVENUE | NEWTOWN AVENUE | 30 STREET | 31 STREET | 30 STREET | 31 STREET | ADDRESS | ASTORIA | NEWTOWN AVENUE | None | Closed | NaT | The Police Department responded to the complaint and with the information available observed no evidence of the violation at that time. | 2023-03-12 22:53:57 | 01 QUEENS | 4.005988e+09 | QUEENS | 1006094.0 | 219178.0 | ONLINE | Unspecified | QUEENS | None | None | None | None | None | None | None | 40.768242 | -73.921140 | (40.76824237904741, -73.92113992853868) | 16859.0 | 39.0 | 3.0 | 4.0 | 72.0 | 0 days 04:01:22 |
| 2156229 | 57022226 | 2023-03-12 18:52:08 | 2023-03-12 21:10:51 | NYPD | New York City Police Department | Illegal Parking | Blocked Hydrant | Street/Sidewalk | 11218.0 | 390 OCEAN PARKWAY | OCEAN PARKWAY | AVENUE C | CORTELYOU ROAD | AVENUE C | CORTELYOU ROAD | ADDRESS | BROOKLYN | OCEAN PARKWAY | None | Closed | NaT | The Police Department responded and upon arrival those responsible for the condition were gone. | 2023-03-12 21:10:54 | 12 BROOKLYN | 3.053740e+09 | BROOKLYN | 991638.0 | 172258.0 | MOBILE | Unspecified | BROOKLYN | None | None | None | None | None | None | None | 40.639482 | -73.973380 | (40.63948194101334, -73.97337969679486) | 17620.0 | 2.0 | 2.0 | 27.0 | 39.0 | 0 days 02:18:43 |
| 2156230 | 57026255 | 2023-03-12 18:52:00 | 2023-03-14 08:35:54 | HPD | Department of Housing Preservation and Development | HEAT/HOT WATER | ENTIRE BUILDING | RESIDENTIAL BUILDING | 11106.0 | 31-35 CRESCENT STREET | CRESCENT STREET | None | None | None | None | ADDRESS | ASTORIA | None | None | Closed | NaT | The complaint you filed is a duplicate of a condition already reported by another tenant for a building-wide condition. The original complaint is still open. HPD may attempt to contact you to verify the correction of the condition or may conduct an inspection of your unit if the original complainant is not available for verification. | 2023-03-14 00:00:00 | 01 QUEENS | 4.005790e+09 | QUEENS | 1004421.0 | 217880.0 | MOBILE | Unspecified | QUEENS | None | None | None | None | None | None | None | 40.764684 | -73.927184 | (40.76468368198577, -73.92718359841146) | 16863.0 | 39.0 | 3.0 | 4.0 | 72.0 | 1 days 13:43:54 |
| 2156231 | 57025202 | 2023-03-12 18:51:46 | 2023-03-12 18:59:55 | NYPD | New York City Police Department | Noise - Residential | Banging/Pounding | Residential Building/House | 10468.0 | 233 LANDING ROAD | LANDING ROAD | CEDAR AVENUE | MAJOR DEEGAN EXPRESSWAY | CEDAR AVENUE | MAJOR DEEGAN EXPRESSWAY | ADDRESS | BRONX | LANDING ROAD | None | Closed | NaT | The Police Department reviewed your complaint and provided additional information below. | 2023-03-12 19:00:00 | 07 BRONX | 2.032368e+09 | BRONX | 1008841.0 | 253488.0 | ONLINE | Unspecified | BRONX | None | None | None | None | None | None | None | 40.862406 | -73.911097 | (40.86240645467484, -73.9110{\n error : true,\n message : Internal error,\n status : 500\n} | NaN | NaN | NaN | NaN | NaN | 0 days 00:08:09 |